Overview

Dataset statistics

Number of variables42
Number of observations742
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory243.6 KiB
Average record size in memory336.2 B

Variable types

Numeric7
Categorical35

Alerts

Job Title has a high cardinality: 264 distinct values High cardinality
Salary Estimate has a high cardinality: 416 distinct values High cardinality
Job Description has a high cardinality: 463 distinct values High cardinality
Company Name has a high cardinality: 343 distinct values High cardinality
Location has a high cardinality: 200 distinct values High cardinality
Headquarters has a high cardinality: 198 distinct values High cardinality
Industry has a high cardinality: 60 distinct values High cardinality
Competitors has a high cardinality: 128 distinct values High cardinality
company_txt has a high cardinality: 343 distinct values High cardinality
Founded is highly correlated with Size and 4 other fieldsHigh correlation
Lower Salary is highly correlated with Industry and 3 other fieldsHigh correlation
Upper Salary is highly correlated with Industry and 5 other fieldsHigh correlation
Avg Salary(K) is highly correlated with Industry and 4 other fieldsHigh correlation
Age is highly correlated with Size and 5 other fieldsHigh correlation
spark is highly correlated with Industry and 3 other fieldsHigh correlation
keras is highly correlated with pytorch and 2 other fieldsHigh correlation
pytorch is highly correlated with keras and 2 other fieldsHigh correlation
scikit is highly correlated with keras and 2 other fieldsHigh correlation
tensor is highly correlated with keras and 2 other fieldsHigh correlation
hadoop is highly correlated with spark and 1 other fieldsHigh correlation
Employer provided is highly correlated with Rating and 3 other fieldsHigh correlation
Type of ownership is highly correlated with Industry and 3 other fieldsHigh correlation
job_title_sim is highly correlated with Industry and 10 other fieldsHigh correlation
Python is highly correlated with Industry and 3 other fieldsHigh correlation
Size is highly correlated with Rating and 7 other fieldsHigh correlation
Hourly is highly correlated with Industry and 3 other fieldsHigh correlation
Industry is highly correlated with Rating and 18 other fieldsHigh correlation
Sector is highly correlated with Rating and 13 other fieldsHigh correlation
google_an is highly correlated with Industry and 1 other fieldsHigh correlation
Job Location is highly correlated with Size and 9 other fieldsHigh correlation
sql is highly correlated with Industry and 4 other fieldsHigh correlation
Rating is highly correlated with Size and 3 other fieldsHigh correlation
Revenue is highly correlated with Size and 6 other fieldsHigh correlation
tableau is highly correlated with sql and 2 other fieldsHigh correlation
bi is highly correlated with tableauHigh correlation
Degree is highly correlated with Industry and 1 other fieldsHigh correlation
Salary Estimate is uniformly distributed Uniform
Job Description is uniformly distributed Uniform
df_index has unique values Unique

Reproduction

Analysis started2022-10-02 04:33:28.768715
Analysis finished2022-10-02 04:34:06.505287
Duration37.74 seconds
Software versionpandas-profiling v3.3.0
Download configurationconfig.json

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct742
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean469.1293801
Minimum0
Maximum955
Zeros1
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size5.9 KiB
2022-10-02T10:04:06.749994image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile38.05
Q1221.5
median472.5
Q3707.75
95-th percentile908.9
Maximum955
Range955
Interquartile range (IQR)486.25

Descriptive statistics

Standard deviation279.7931171
Coefficient of variation (CV)0.5964092828
Kurtosis-1.215840104
Mean469.1293801
Median Absolute Deviation (MAD)244
Skewness0.004952491596
Sum348094
Variance78284.18837
MonotonicityStrictly increasing
2022-10-02T10:04:07.054185image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01
 
0.1%
6381
 
0.1%
6291
 
0.1%
6301
 
0.1%
6311
 
0.1%
6321
 
0.1%
6331
 
0.1%
6341
 
0.1%
6351
 
0.1%
6361
 
0.1%
Other values (732)732
98.7%
ValueCountFrequency (%)
01
0.1%
11
0.1%
21
0.1%
31
0.1%
41
0.1%
51
0.1%
61
0.1%
71
0.1%
81
0.1%
91
0.1%
ValueCountFrequency (%)
9551
0.1%
9531
0.1%
9521
0.1%
9511
0.1%
9501
0.1%
9491
0.1%
9481
0.1%
9471
0.1%
9461
0.1%
9451
0.1%

Job Title
Categorical

HIGH CARDINALITY

Distinct264
Distinct (%)35.6%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
Data Scientist
131 
Data Engineer
53 
Senior Data Scientist
 
34
Data Analyst
 
15
Senior Data Engineer
 
14
Other values (259)
495 

Length

Max length98
Median length66
Mean length27.94204852
Min length9

Characters and Unicode

Total characters20733
Distinct characters66
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique106 ?
Unique (%)14.3%

Sample

1st rowData Scientist
2nd rowHealthcare Data Scientist
3rd rowData Scientist
4th rowData Scientist
5th rowData Scientist

Common Values

ValueCountFrequency (%)
Data Scientist131
 
17.7%
Data Engineer53
 
7.1%
Senior Data Scientist34
 
4.6%
Data Analyst15
 
2.0%
Senior Data Engineer14
 
1.9%
Senior Data Analyst12
 
1.6%
Lead Data Scientist8
 
1.1%
Marketing Data Analyst6
 
0.8%
Sr. Data Engineer6
 
0.8%
Machine Learning Engineer5
 
0.7%
Other values (254)458
61.7%

Length

2022-10-02T10:04:07.386831image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
data567
20.1%
scientist420
 
14.9%
173
 
6.1%
engineer160
 
5.7%
senior124
 
4.4%
analyst102
 
3.6%
sr48
 
1.7%
analytics38
 
1.3%
science36
 
1.3%
associate33
 
1.2%
Other values (313)1122
39.7%

Most occurring characters

ValueCountFrequency (%)
t2114
 
10.2%
2081
 
10.0%
a2004
 
9.7%
i1852
 
8.9%
e1744
 
8.4%
n1605
 
7.7%
c940
 
4.5%
s938
 
4.5%
r857
 
4.1%
S799
 
3.9%
Other values (56)5799
28.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter14912
71.9%
Uppercase Letter3194
 
15.4%
Space Separator2081
 
10.0%
Other Punctuation228
 
1.1%
Dash Punctuation193
 
0.9%
Open Punctuation42
 
0.2%
Close Punctuation41
 
0.2%
Decimal Number41
 
0.2%
Math Symbol1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t2114
14.2%
a2004
13.4%
i1852
12.4%
e1744
11.7%
n1605
10.8%
c940
6.3%
s938
6.3%
r857
5.7%
o641
 
4.3%
l532
 
3.6%
Other values (15)1685
11.3%
Uppercase Letter
ValueCountFrequency (%)
S799
25.0%
D661
20.7%
A290
 
9.1%
E261
 
8.2%
M151
 
4.7%
I150
 
4.7%
C136
 
4.3%
L118
 
3.7%
P111
 
3.5%
R102
 
3.2%
Other values (14)415
13.0%
Decimal Number
ValueCountFrequency (%)
214
34.1%
012
29.3%
18
19.5%
43
 
7.3%
52
 
4.9%
92
 
4.9%
Other Punctuation
ValueCountFrequency (%)
,103
45.2%
/62
27.2%
.35
 
15.4%
&26
 
11.4%
:2
 
0.9%
Dash Punctuation
ValueCountFrequency (%)
-184
95.3%
9
 
4.7%
Space Separator
ValueCountFrequency (%)
2081
100.0%
Open Punctuation
ValueCountFrequency (%)
(42
100.0%
Close Punctuation
ValueCountFrequency (%)
)41
100.0%
Math Symbol
ValueCountFrequency (%)
|1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin18106
87.3%
Common2627
 
12.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
t2114
11.7%
a2004
11.1%
i1852
10.2%
e1744
 
9.6%
n1605
 
8.9%
c940
 
5.2%
s938
 
5.2%
r857
 
4.7%
S799
 
4.4%
D661
 
3.7%
Other values (39)4592
25.4%
Common
ValueCountFrequency (%)
2081
79.2%
-184
 
7.0%
,103
 
3.9%
/62
 
2.4%
(42
 
1.6%
)41
 
1.6%
.35
 
1.3%
&26
 
1.0%
214
 
0.5%
012
 
0.5%
Other values (7)27
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII20724
> 99.9%
Punctuation9
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t2114
 
10.2%
2081
 
10.0%
a2004
 
9.7%
i1852
 
8.9%
e1744
 
8.4%
n1605
 
7.7%
c940
 
4.5%
s938
 
4.5%
r857
 
4.1%
S799
 
3.9%
Other values (55)5790
27.9%
Punctuation
ValueCountFrequency (%)
9
100.0%

Salary Estimate
Categorical

HIGH CARDINALITY
UNIFORM

Distinct416
Distinct (%)56.1%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
$49K-$113K (Glassdoor est.)
 
6
$86K-$143K (Glassdoor est.)
 
6
$54K-$115K (Glassdoor est.)
 
6
$21-$34 Per Hour(Glassdoor est.)
 
6
$74K-$124K (Glassdoor est.)
 
5
Other values (411)
713 

Length

Max length41
Median length27
Mean length27.26684636
Min length24

Characters and Unicode

Total characters20232
Distinct characters37
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique196 ?
Unique (%)26.4%

Sample

1st row$53K-$91K (Glassdoor est.)
2nd row$63K-$112K (Glassdoor est.)
3rd row$80K-$90K (Glassdoor est.)
4th row$56K-$97K (Glassdoor est.)
5th row$86K-$143K (Glassdoor est.)

Common Values

ValueCountFrequency (%)
$49K-$113K (Glassdoor est.)6
 
0.8%
$86K-$143K (Glassdoor est.)6
 
0.8%
$54K-$115K (Glassdoor est.)6
 
0.8%
$21-$34 Per Hour(Glassdoor est.)6
 
0.8%
$74K-$124K (Glassdoor est.)5
 
0.7%
$76K-$142K (Glassdoor est.)5
 
0.7%
$107K-$173K (Glassdoor est.)5
 
0.7%
$81K-$167K (Glassdoor est.)5
 
0.7%
$68K-$139K (Glassdoor est.)4
 
0.5%
$63K-$105K (Glassdoor est.)4
 
0.5%
Other values (406)690
93.0%

Length

2022-10-02T10:04:07.892715image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
est725
32.4%
glassdoor692
30.9%
per24
 
1.1%
hour(glassdoor21
 
0.9%
provided17
 
0.8%
employer17
 
0.8%
49k-$113k6
 
0.3%
86k-$143k6
 
0.3%
54k-$115k6
 
0.3%
21-$346
 
0.3%
Other values (413)721
32.2%

Most occurring characters

ValueCountFrequency (%)
s2151
 
10.6%
1499
 
7.4%
o1496
 
7.4%
$1484
 
7.3%
K1436
 
7.1%
1919
 
4.5%
r824
 
4.1%
e795
 
3.9%
l759
 
3.8%
a747
 
3.7%
Other values (27)8122
40.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter8406
41.5%
Decimal Number3649
18.0%
Uppercase Letter2260
 
11.2%
Space Separator1499
 
7.4%
Currency Symbol1484
 
7.3%
Dash Punctuation742
 
3.7%
Other Punctuation742
 
3.7%
Close Punctuation725
 
3.6%
Open Punctuation725
 
3.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s2151
25.6%
o1496
17.8%
r824
 
9.8%
e795
 
9.5%
l759
 
9.0%
a747
 
8.9%
d747
 
8.9%
t725
 
8.6%
y46
 
0.5%
p29
 
0.3%
Other values (4)87
 
1.0%
Decimal Number
ValueCountFrequency (%)
1919
25.2%
2356
 
9.8%
0328
 
9.0%
6317
 
8.7%
4304
 
8.3%
5293
 
8.0%
9285
 
7.8%
8285
 
7.8%
3283
 
7.8%
7279
 
7.6%
Uppercase Letter
ValueCountFrequency (%)
K1436
63.5%
G713
31.5%
P41
 
1.8%
E29
 
1.3%
H24
 
1.1%
S17
 
0.8%
Other Punctuation
ValueCountFrequency (%)
.725
97.7%
:17
 
2.3%
Space Separator
ValueCountFrequency (%)
1499
100.0%
Currency Symbol
ValueCountFrequency (%)
$1484
100.0%
Dash Punctuation
ValueCountFrequency (%)
-742
100.0%
Close Punctuation
ValueCountFrequency (%)
)725
100.0%
Open Punctuation
ValueCountFrequency (%)
(725
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin10666
52.7%
Common9566
47.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
s2151
20.2%
o1496
14.0%
K1436
13.5%
r824
 
7.7%
e795
 
7.5%
l759
 
7.1%
a747
 
7.0%
d747
 
7.0%
t725
 
6.8%
G713
 
6.7%
Other values (10)273
 
2.6%
Common
ValueCountFrequency (%)
1499
15.7%
$1484
15.5%
1919
9.6%
-742
7.8%
)725
7.6%
.725
7.6%
(725
7.6%
2356
 
3.7%
0328
 
3.4%
6317
 
3.3%
Other values (7)1746
18.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII20232
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s2151
 
10.6%
1499
 
7.4%
o1496
 
7.4%
$1484
 
7.3%
K1436
 
7.1%
1919
 
4.5%
r824
 
4.1%
e795
 
3.9%
l759
 
3.8%
a747
 
3.7%
Other values (27)8122
40.1%

Job Description
Categorical

HIGH CARDINALITY
UNIFORM

Distinct463
Distinct (%)62.4%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
Description Medical Laboratory Scientist - Texas Health Huguley- operated as joint venture between Texas Health Resources and AdventHealth Location Address: 11801 South Fwy., Burleson, TX 76028 Top Reasons to Work At Texas Health Huguley, Burleson, TX Our care for patients extend to the spiritual level by praying with patients and families and providing on call, 24 hours, 7 days a week Chaplains for spiritual support. Award winning facility and departments including Great Place to Work by Beckers Hospital Review and Gallup. Work with the latest technology and top experts including Daisy Award recipients while on the way to Magnet status2020. Amazing medical benefits through Aetna plus an onsite full-service fitness center. Growth opportunities designed for each employee. Located about 10 minutes from downtown Fort Worth and near TCU in the award-winning school district, Burleson ISD which also provides a low-cost of living. Work Hours/Shift: Full Time 3rd Shift You Will Be Responsible For: Accurately performs and expeditiously reports laboratory tests, according to departmental policies, CLIA law and regulatory standards. Determines identity and suitability of specimens when received, according to procedure policies. Assures test accuracy by performing and recording control testing, in accordance with departmental policy on each shift, as observed by the department supervisor. Performs tests in accordance with department policy and reports results in a timely manner, as documented in test records. Organizes daily work efficiently and expedites testing as observed by supervisor. Investigates problems and takes initiative in resolving them, both technical and non-technical, and communicates the outcome to the appropriate individual. Reviews integrity of report by checking legibility, completeness and credibility of results prior to releasing, as observed by supervisor. Accurately performs, records, transcribes or reviews proficiency testing material by the stated deadline, according to laboratory policy and CLIA law, as evidenced in the survey evaluation results. Maintains instrumentation and department supplies in order to ensure efficient departmental operations. Performs and documents assigned equipment maintenance duties per shift according to maintenance manuals. Restocks department supplies on a regular basis to ensure adequacy of inventory, as documented on department checklist. Recognizes problems in instrumentation, performs first line repairs as outlined in procedure manual and notifies appropriate individual if unable to adequately alleviate the problem. Promotes and contributes positively to intradepartmental and interdepartmental communications to ensure efficient departmental operation. Answers telephone and pages promptly and in a courteous manner, identifying self and department at all times. Assists in orientation and training of new associates unfamiliar with the department, as observed by the supervisor. Relays all appropriate information during shift hand-off, and ensures department is covered prior to leaving. Demonstrates good judgment in directing phone calls or questions to the appropriate department or individual. Promotes a safe working environment by regimented clean up and adherence to safety manuals. Adheres to established departmental guidelines for clean up on each shift. Protects self and co-workers by practicing safety precautions as established in safety and infection control manuals. Maintains appropriate departmental records and filing systems to ensure the expeditious retrieval of information and to comply with regulatory requirements. Ensures that results are properly filed and that data is recorded in a legible manner as evidenced in departmental records. Exhibits ability to perform essential computer operations pertaining to job duties. Performs outpatient ordering as defined in the department policy. Maintains a professional attitude in the work place regarding procedures, personnel and continuing education. Demonstrates a willingness to learn new procedures and instrumentation by attending training sessions and becoming familiar with new revisions as they occur. Reviews policies frequently, assists in updating contents and complies with procedures contained therein. Demonstrates flexibility by being able to work various departments and alternative shifts when requested. Exhibits willingness to help associates in their tasks when able. Accepts constructive criticism and feedback. Maintains necessary continuing education credits, keeps required competencies current, and maintains valid ASCP Certification. Qualifications What You Will Need: Must have a B.S. degree in Medical Technology, Medical Laboratory Science or a related science. Valid ASCP or AMT Technologist/Scientist Certification. At least one year of experience is preferred. Job Summary: Perform and report clinical laboratory analysis to assist physicians and other hospital staff in the diagnosis and monitoring of patients. This facility is an equal opportunity employer and complies with federal, state and local anti-discrimination laws, regulations and ordinances.
 
4
Palermo Villa Inc. is interested in a high-energy, poised and confident individual to assist in the development of concepts, products and optimization projects through Palermo's vigorous consumer-driven R&D process. The position will apply scientific and culinary principles in research and development. Develops the understanding of and ability to translate food trends into innovative opportunities, stimulate new food ideas and product concepts. Identify, evaluate and develop potential new product development opportunities. From bench-top samples to commercialized products and finished product specifications Assist in food product formulation from bench top to commercialization using a continuously developing skill set in food formulation and processing equipment capability understanding. Applies an analytical approach to the solution of a wide variety of problems and assimilates the details and significance of various scientific analyses, procedures, and tests Demonstrates initiative, creativity and thoroughness in the execution of complex projects Plans and conducts independent research projects and participates in the development of project objectives Contributes to the development of project strategies and recommends technical direction to management Evaluates technical trends in their specific area of expertise or assignment and makes recommendations for process or product improvements and identify opportunities for new or improved process or products Organize and direct sample development for sales presentations, consumer testing and food safety assurance Maintains written technical documentation and product and process specifications as pertaining to R&D Utilizes or directs internal (manufacturing, engineering, marketing, quality systems, procurement) and external (suppliers, consultants) functional experts to resolve issues. Assist in PR events, food shows and Sales presentations on key customer calls Provide technical support/serves as product development contact for Sales, Customer and Operations To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The position requires 5+ years' experience developing products within the food industry. Strong interpersonal and communication skills Ability to effectively present information to top management, public groups, and/or boards of directors. Ability to apply mathematical operations to such tasks as frequency distribution, determination of test reliability and validity, analysis of variance, correlation techniques, sampling theory, and factor analysis. Ability to define problems, collect data, establish facts, and draw valid conclusions. Strong computer skills are necessary Educational Requirements: Bachelor's Degree in Food Science, Biology, Chemistry, Culinary or equivalent
 
4
Under direct supervision of the Director of Database Marketing, the Marketing Data Analyst will work closely with members of the database marketing team and the FP&A marketing analysis team to derive insights from large amounts of customer and transactional data to develop segmentations, strategies, visualizations, reports, and recommendations for various marketing purposes. The Marketing Data Analyst will assist management with the interpretation, evaluation and interrelationship of data and generate integrated business analysis and projections to facilitate decision making. Essential Duties & Responsibilities Develop queries in SAS that create marketing campaigns to optimize profit and produce multi-channel campaign outputs. Design and evaluate various tests and optimizations of campaigns. Monitor the quality of all data at both the project and output level for the Database Marketing team. Support the integration of new data sources and analyze and confirm the overall quality and integrity of source data. Generate database extracts for Database Marketing teams as needed. Provide campaign analytics to extended team, including insights and recommendations to improve message effectiveness and campaign performance. Build business intelligence, reports and dashboards using software like SAS, Microsoft Excel/VBA, Tableau, or SAS Visual Analytics that include segment/campaign profitability and customer behavior or trends. Create relationships with internal stakeholders to discover how data, platform and tools can assist to execute business needs. Identify new business opportunities or potential risks based on data analysis on subject matters of various operations departments. Train users as needed. Perform other duties as assigned to support the efficient operation of the department. Education/Experience/Qualifications Bachelors or Masters Degree in Computer Science, Economics, Marketing, Finance, Mathematics, or related field required. 2+ years of experience with SAS and/or SQL and analyzing large datasets. Equivalent combination of education and progressive, relevant and direct experience may be considered in lieu of minimum educational/experience requirements indicated above. Advanced proficiency in Microsoft Excel and Word. Experience working with relational databases is required. Experience in programming/scripting. Experience with data visualization, reporting & dash boarding tools such as SAS visual Analytics or Tableau. Experience with Google Analytics custom reports and dashboards preferred. Familiarity with marketing methodologies and systems such as segmentation modeling, targeting, CRM, and ROI projections and evaluation. Predictive Modeling experience preferred. Employee must have experience demonstrating the utmost discretion and confidentiality as they will have access to confidential information including, but not limited to: customer contact information, customer financial data, and organizational financial data. Excellent communication skills, both written and verbal. Must be able to obtain/maintain any necessary certifications and/or licenses. Ability to mentor coordinators and administrative staff. Ability to work with mathematical concepts such as probability and statistical inference. Ability to apply concepts such as fractions, percentages, ratios, and proportions to practical situations, including the development of financial statistical models and forecasts. Ability to define problems, collect data, establish facts, and draw valid conclusions with minimal direction. Ability to interpret an extensive variety of technical instructions in mathematical or diagram form and deal with several abstract and concrete variables. Ability to effectively present information to, and respond to questions from, groups of managers and directors. Ability to read, analyze, and interpret general business periodicals, professional journals, technical procedures, governmental regulations, financial reports, and legal documents. Ability to respond to common inquiries or complaints from customers, regulatory agencies, or members of the business community. Certificates/Licenses/Registrations At the discretion of the San Manuel Tribal Gaming Commission you may be required to obtain and maintain a gaming license. San Manuel Band of Mission Indians and San Manuel Casino will make reasonable accommodations in compliance with the Americans with Disabilities Act of 1990. As one of the largest private employers in the Inland Empire, San Manuel deeply cares about the future, growth and well-being of its employees. Join our team today!
 
4
Responsibilities Include but may not be limited to: performing various tasks assisting in development of new items, renovation of existing formulations, and supports efforts to ensure quality product is produced, maintained, and documented. Additionally, this position will underwrite efforts in product development and distribution by our sales and prourement teams. This position will also be responsible for maintaining and entering data in several databases. As a member of the R&D Team, you will help develop products which can be reproduced in a large-scale food manufacturing environment. Assist in the development of new bean products from concept approval, formulation, product development, plant trial runs to launch and post-launch review by collaborating with Marketing , Sales, Project management, QA, and Production. Participate as an active member of cross-functional business development teams comprosed of individuals from a variety of desciplines, includjing Marketing, Finance, Purchasing and many others. Assist in redesign & renovation of existing products to increase quality, reduce costs, and/or increase production efficiencies. Partner internally and extenally to source new ingredients and leverage vendor expertise in ingredient functionality. Assist Quality and Procurement departments in maintaining specifications for new ingredients and/or suppliers. Supoport production with troubleshooting out of spec product or production concerns on established products. Maintain accurate product records, documentation and archives in various databases including global data synchronization of existing retail business. Maintain laboratory, including upkeep of equipment, stocking of supplies, and general cleaning of work areas. Performs other related and assigned duties as necessary. Minimun Qualifications Must hold a Bachelors degree in Food Science from an accredited University. Previous experience in food product development & food manufacturing strongly preferred. Ability and interest to work in laboratory, pilot plant and manufacturing scale environments. Proven ability to manage multiple assignments/tasks. Ability to work independently while collaborating and communicating with team members in various departments. Strong communication skills (oral and written). Knowlege of Genesis labeling system preferred but not required. Must be physically capable of lifting 50lbs. weight restriction.
 
4
What We Do: At the SEI Emerging Technology Center, we describe our work as “making the recently possible mission-practical.” We help our government customers stay at the leading edge of technology by identifying, demonstrating, extending, and applying emerging software technologies to solve real government problems. We currently work in the fields of human-machine interaction, applied artificial intelligence and machine learning, and advanced computing—areas that are changing and progressing rapidly. As we show our customers how new technologies can improve their mission capabilities through rapid prototyping and iterative development, we both rely on and shape academic and industrial research. Are you creative, curious, energetic, collaborative, technology-focused, and hard-working? Are you interested in making a difference by bringing innovation to government organizations and beyond? Apply to join our team. Position Summary: As a senior research scientist focusing on machine learning, you will identify, shape, apply, conduct, and lead research that matches critical U.S. government needs. Requirements: BS in Computer Science or related discipline with ten (10) years of experience; OR MS in the same fields with eight (8) years of experience; OR PhD with five (5) years of experience. Flexible to travel to other SEI offices in Pittsburgh and Washington, DC, sponsor sites, conferences, and offsite meetings on occasion. Moderate (25") travel outside of your home location. You will be subject to a background investigation and must be eligible to obtain and maintain a Department of Defense security clearance. Duties: Hands-on research: You’ll conduct and lead novel research in applied machine learning and artificial intelligence. Solution development: You’ll work with and lead interdisciplinary teams to turn research results into prototype operational capabilities for government customers and stakeholders. Strategy: You’ll work with Center leaders and colleagues to plan, develop, and carry out an overall research strategy, and to influence the national research agenda regarding future technology. Collaboration: You'll actively participate on teams of software developers, researchers, designers, and technical leads. You'll build relationships and collaborate with researchers, government customers, and other stakeholders to understand challenges, needs, possible solutions, and research directions. Mentoring: You'll contribute to improving the overall technical capabilities of the Center by mentoring and teaching others, participating in design (software and otherwise) sessions, and sharing insights and wisdom across the SEI Emerging Technology Center team. Knowledge, Skills, and Abilities: Deep technical knowledge: You have performed extensive research in applied machine learning and artificial intelligence. You have worked with tools, techniques, algorithms, software, and programming languages for deep learning, reinforcement learning, statistics, sensors and sensor fusion, planning, computer vision, or related areas. Communication and Collaboration: You have strong written and verbal communication skills and can interact collaboratively and diplomatically with customers and colleagues. You grasp the big picture, direction, and goals of an effort while focusing great attention to detail. You can present complex ideas to people who may not have a deep understanding of the subject area. Dedication: You can meet deadlines while multi-tasking–sometimes under pressure and with shifting priorities. Creativity and Innovation: You are creative and curious, and you are inspired by the prospect of collaborating with premier researchers and visionaries at Carnegie Mellon and other universities and organizations. You quickly learn new procedures, techniques, and approaches. You are forward-looking and can connect research with practical challenges. Knowledge and Learning: You possess broad technical interests along with a deep knowledge of a particular field such as human-computer interaction, data analytics and machine learning, advanced computing, and autonomy and adaptive systems. Desired Experience: Research practices and publications: You have a track record of conducting research in machine learning and artificial intelligence. You have a reputation for the highest level of research and technical integrity. You have demonstrated contributions and have published research. Familiarity with emerging trends and opportunities: You are familiar with technical challenges and emerging trends in computing and information science, and you are aware of opportunities in industry and government. Technical leadership: You have led research projects and have experience collaborating across research teams and mentoring other researchers. Proposals: You have formulated and delivered successful research proposals to funding agencies and led the resulting projects. Government projects: You have worked or are familiar with DARPA, IARPA, Service Labs, or other government research sponsors More Information Please visit “Why Carnegie Mellon” to learn more about becoming part of an institution inspiring innovations that change the world. A listing of employee benefits is available at: www.cmu.edu/jobs/benefits-at-a-glance/. Carnegie Mellon University is an Equal Opportunity Employer/Disability/Veteran.
 
4
Other values (458)
722 

Length

Max length10051
Median length4328
Mean length3869.545822
Min length407

Characters and Unicode

Total characters2871203
Distinct characters118
Distinct categories19 ?
Distinct scripts3 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique238 ?
Unique (%)32.1%

Sample

1st rowData Scientist Location: Albuquerque, NM Education Required: Bachelor’s degree required, preferably in math, engineering, business, or the sciences. Skills Required: Bachelor’s Degree in relevant field, e.g., math, data analysis, database, computer science, Artificial Intelligence (AI); three years’ experience credit for Master’s degree; five years’ experience credit for a Ph.D Applicant should be proficient in the use of Power BI, Tableau, Python, MATLAB, Microsoft Word, PowerPoint, Excel, and working knowledge of MS Access, LMS, SAS, data visualization tools, and have a strong algorithmic aptitude Excellent verbal and written communication skills, and quantitative analytical skills are required Applicant must be able to work in a team environment U.S. citizenship and ability to obtain a DoD Secret Clearance required Responsibilities: The applicant will be responsible for formulating analytical solutions to complex data problems; creating data analytic models to improve data metrics; analyzing customer behavior and trends; delivering insights to stakeholders, as well as designing and crafting reports, dashboards, models, and algorithms to make data insights actionable; selecting features, building and optimizing classifiers using machine learning techniques; data mining using state-of-the-art methods, extending organization’s data with third party sources of information when needed; enhancing data collection procedures to include information that is relevant for building analytic systems; processing, cleansing, and verifying the integrity of data used for analysis; doing ad-hoc analysis and presenting results in a clear manner; and creating automated anomaly detection systems and constant tracking of its performance. Benefits: We offer competitive salaries commensurate with education and experience. We have an excellent benefits package that includes: Comprehensive health, dental, life, long and short term disability insurance 100% Company funded Retirement Plans Generous vacation, holiday and sick pay plans Tuition assistance Benefits are provided to employees regularly working a minimum of 30 hours per week. Tecolote Research is a private, employee-owned corporation where people are our primary resource. Our investments in technology and training give our employees the tools to ensure our clients are provided the solutions they need, and our very high employee retention rate and stable workforce is an added value to our customers. Apply now to connect with a company that invests in you.
2nd rowWhat You Will Do: I. General Summary The Healthcare Data Scientist position will join our Advanced Analytics group at the University of Maryland Medical System (UMMS) in support of its strategic priority to become a data-driven and outcomes-oriented organization. The successful candidate will have 3+ years of experience with Machine Learning, Predictive Modeling, Statistical Analysis, Mathematical Optimization, Algorithm Development and a passion for working with healthcare data. Previous experience with various computational approaches along with an ability to demonstrate a portfolio of relevant prior projects is essential. This position will report to the UMMS Vice President for Enterprise Data and Analytics (ED&A). II. Principal Responsibilities and Tasks • Develops predictive and prescriptive analytic models in support of the organization’s clinical, operations and business initiatives and priorities. • Deploys solutions so that they provide actionable insights to the organization and are embedded or integrated with application systems • Supports and drives analytic efforts designed around organization’s strategic priorities and clinical/business problems • Works in a team to drive disruptive innovation, which may translate into improved quality of care, clinical outcomes, reduced costs, temporal efficiencies and process improvements. • Builds and extends our analytics portfolio supported by robust documentation • Works with autonomy to find solutions to complex problems using open source tools and in-house development • Stays abreast of state-of-the-art literature in the fields of operations research, statistical modeling, statistical process control and mathematical optimization • Creates, communicates, and manages the project plans and other required project documentation and provides updates to leadership as necessary • Develops and maintains relationships with business, IT and clinical leaders and stakeholders across the enterprise to facilitate collaboration and effective communication • Works with the analytics team and clinical/business stakeholders to develop pilots so that they may be tested and validated in pilot settings • Performs analysis to evaluate primary and secondary objectives from such pilots • Assists leadership with strategies for scaling successful projects across the organization and enhances the analytics applications based on feedback from end-users and clinical/business consumers • Assists leadership with dissemination of success stories (and failures) in an effort to increase analytics literacy and adoption across the organization. What You Need to Be Successful: III. Education and Experience • Master’s or higher degree (may be substituted by relevant work experience) in applied mathematics, physics, computer science, engineering, statistics or a related field • 3+ years of Mathematical Optimization, Machine Learning, Predictive Analytics and Algorithm Development experience (experience with tools such as WEKA, RapidMiner, R. Python or other open source tools strongly desired) • Strong development skills in two or more of the following: C/C++, C#, Python, Java • Combining analytic methods with advanced data visualizations • Expert ability to breakdown and clearly define problems • Experience with Natural Language Processing preferred IV. Knowledge, Skills and Abilities • Proven communications skills – Effective at working independently and in collaboration with other staff members. Capable of clearly presenting findings orally, in writing, or through graphics. • Proven analytical skills – Able to compare, contrast, and validate work with keen attention to detail. Skilled in working with “real world” data including scrubbing, transformation, and imputation. • Proven problem solving skills – Able to plan work, set clear direction, and coordinate own tasks in a fast-paced multidisciplinary environment. Expert at triaging issues, identifying data anomalies, and debugging software. • Design and prototype new application functionality for our products. • Change oriented – actively generates process improvements; supports and drives change, and confronts difficult circumstances in creative ways • Effective communicator and change agent • Ability to prioritize the tasks of the project timeline to achieve the desired results • Strong analytic and problem solving skills • Ability to cooperatively and effectively work with people from various organization levels We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class.
3rd rowKnowBe4, Inc. is a high growth information security company. We are the world's largest provider of new-school security awareness training and simulated phishing. KnowBe4 was created to help organizations manage the ongoing problem of social engineering. Tens of thousands of organizations worldwide use KnowBe4's platform to mobilize their end users as a last line of defense and enable them to make better security decisions, every day. We are ranked #1 best place to work in technology nationwide by Fortune Magazine and have placed #1 or #2 in The Tampa Bay Top Workplaces Survey for the last four years. We also just had our 27th record-setting quarter in a row! The Data Scientist will work closely with the VP of FP&A and the Quantitative Analytics Manager to implement advanced analytical models and other data-driven solutions. Responsibilities: Work with key stakeholders throughout the organization to identify opportunities using financial data to develop business solutions. Develop new and enhance existing data collection procedures to ensure that all data relevant for analyses is captured. Cleanse, consolidate, and verify the integrity of data used in analyses. Build and validate predictive models to increase customer retention, revenue generation, and other business outcomes. Develop relevant statistical models to assist with profitability forecasting Create the analytics to leverage known, inferred and appended information about origins and recognizing patterns to assist in outlook forecasting Assist in the design and data modeling of data warehouse. Visualize data, especially in reports and dashboards, to communicate analysis results to stakeholders. Extend data collection to unstructured data within the company and external sources Mine and collect data (both structured and unstructured) to detect patterns, opportunities and insights that drive our organization Create and execute automation and data mining requests utilizing SQL, Access, Excel, SAS and other statistical programs Trouble shoot forecast and optimization anomalies with FP&A team through the use of statistical and mathematical optimization models. Develop testing to explain and or reduce these anomalies. Oversee and develop key metric forecasts as well as provide budget support based on trends in the business/industry. Minimum Qualifications: Master's degree in Statistics, Computer Science, Mathematics or other quantitative discipline required 2-3 years of experience in similar role (Master's Degree) 0-2 years of experience in similar role (PhD) Experience leveraging predictive modeling, big data analytics, exploratory data analysis and machine learning to drive significant business impact Experience with statistical computer languages (Python, R etc.) to manipulate and analyze large datasets preferred. Experience with data visualization tools like D3.js, matplotlib, etc., preferred Exceptional understanding of machine learning algorithms such as Random Forest, SVM, k-NN, Naïve Bayes, Gradient Boosting a plus. Applied statistical skills including statistical testing, regression, etc. Experience with data bases, query languages, and associated data architecture. Experience with distributed computing tools (Hive, Spark, etc.) is a plus. Strong analytical skills and ability to meet project deadlines. Note: An applicant assessment, background check and drug test may be part of your hiring procedure. No recruitment agencies, please.
4th row*Organization and Job ID** Job ID: 310709 Directorate: Earth & Biological Sciences Division: Biological Sciences Group: Exposure Science Team *Job Description** The Biological System Science (BSS) Group in the Biological Sciences Division of the Pacific Northwest National Laboratory (PNNL) is seeking a staff scientist with multidisciplinary experience in computational chemistry, cheminformatics, advanced statistics and/or machine learning/deep learning/AI. Preferred candidates will have a broad understanding of the state of computational metabolomics and experience in designing and implementing novel deep learning networks for chemistry applications. Research experience in drug design, cheminformatics, deep learning, machine learning and/or small molecule identification is also highly valued. Successful candidates will join a large, uniquely collaborative, collegial group of innovators driving the integration of data science, computational science and analytical chemistry to solve the nations most challenging problems in human health, chemical forensics, and national security. The BSS Group is diverse and inclusive, working closely with colleagues across the laboratory with expertise in computational biology, integrative omics, applied mathematics, computer science, and statistics. + Apply knowledge of statistics, machine learning, advanced mathematics, simulation, software development, and data modeling to to design, development and implement methods that integrate, clean and analyze data, recognize patterns, address uncertainty, pose questions, and make discoveries from structured and/or unstructured data. + Produce solutions driven by exploratory data analysis from complex and high-dimensional datasets. + Design, develop, and evaluate predictive models and advanced algorithms that lead to optimal value extraction from data. + Develop and maintain existing deep learning networks that generate novel molecules for drug discovery applications + Contribue or author proposals, peer-reviewed papers, and other technical products. *Minimum Qualifications** BS/BA with 0-1 years of experience or MS/MA with 0-1 years of experience *Preferred Qualifications** + MS in chemical engineering, computer science, or related field with a GPA of 3.5+ 5+ years of research experience + Intermediate level programming experience (preferably Python) and high-performance computing experience + At least one first author published, or proof of submitted, paper applying deep learning for use in novel compound generation + Understanding of the NMDA receptor and potential drug targets + Research experience in drug design, cheminformatics, deep learning, machine learning and/or small molecule identification *Equal Employment Opportunity** Battelle Memorial Institute (BMI) at Pacific Northwest National Laboratory (PNNL) is an Affirmative Action/Equal Opportunity Employer and supports diversity in the workplace. All employment decisions are made without regard to race, color, religion, sex, national origin, age, disability, veteran status, marital or family status, sexual orientation, gender identity, or genetic information. All BMI staff must be able to demonstrate the legal right to work in the United States. BMI is an E-Verify employer. Learn more at jobs.pnnl.gov. *_Please be aware that the Department of Energy (DOE) prohibits DOE employees and contractors from participation in certain foreign government talent recruitment programs. If you are offered a position at PNNL and are currently a participant in a foreign government talent recruitment program you will be required to disclose this information before your first day of employment._** _Directorate:_ _Earth & Biological Sciences_ _Job Category:_ _Scientists/Scientific Support_ _Group:_ _Biological Systems Science_ _Opening Date:_ _2020-03-26_ _Closing Date:_ _2020-04-05_
5th rowData Scientist Affinity Solutions / Marketing Cloud seeks smart, curious, technically savvy candidates to join our cutting-edge data science team. We hire the best and brightest and give them the opportunity to work on industry-leading technologies. The data sciences team at AFS/Marketing Cloud build models, machine learning algorithms that power all our ad-tech/mar-tech products at scale, develop methodology and tools to precisely and effectively measure market campaign effects, and research in-house and public data sources for consumer spend behavior insights. In this role, you'll have the opportunity to come up with new ideas and solutions that will lead to improvement of our ability to target the right audience, derive insights and provide better measurement methodology for marketing campaigns. You'll access our core data asset and machine learning infrastructure to power your ideas. Duties and Responsibilities · Support all clients model building needs, including maintaining and improving current modeling/scoring methodology and processes, · Provide innovative solutions to customized modeling/scoring/targeting with appropriate ML/statistical tools, · Provide analytical/statistical support such as marketing test design, projection, campaign measurement, market insights to clients and stakeholders. · Mine large consumer datasets in the cloud environment to support ad hoc business and statistical analysis, · Develop and Improve automation capabilities to enable customized delivery of the analytical products to clients, · Communicate the methodologies and the results to the management, clients and none technical stakeholders. Basic Qualifications · Advanced degree in Statistics/Mathematics/Computer Science/Economics or other fields that requires advanced training in data analytics. · Being able to apply basic statistical/ML concepts and reasoning to address and solve business problems such as targeting, test design, KPI projection and performance measurement. · Entrepreneurial, highly self-motivated, collaborative, keen attention to detail, willingness and capable learn quickly, and ability to effectively prioritize and execute tasks in a high pressure environment. · Being flexible to accept different task assignments and able to work on a tight time schedule. · Excellent command of one or more programming languages; preferably Python, SAS or R · Familiar with one of the database technologies such as PostgreSQL, MySQL, can write basic SQL queries · Great communication skills (verbal, written and presentation) Preferred Qualifications · Experience or exposure to large consumer and/or demographic data sets. · Familiarity with data manipulation and cleaning routines and techniques.

Common Values

ValueCountFrequency (%)
Description Medical Laboratory Scientist - Texas Health Huguley- operated as joint venture between Texas Health Resources and AdventHealth Location Address: 11801 South Fwy., Burleson, TX 76028 Top Reasons to Work At Texas Health Huguley, Burleson, TX Our care for patients extend to the spiritual level by praying with patients and families and providing on call, 24 hours, 7 days a week Chaplains for spiritual support. Award winning facility and departments including Great Place to Work by Beckers Hospital Review and Gallup. Work with the latest technology and top experts including Daisy Award recipients while on the way to Magnet status2020. Amazing medical benefits through Aetna plus an onsite full-service fitness center. Growth opportunities designed for each employee. Located about 10 minutes from downtown Fort Worth and near TCU in the award-winning school district, Burleson ISD which also provides a low-cost of living. Work Hours/Shift: Full Time 3rd Shift You Will Be Responsible For: Accurately performs and expeditiously reports laboratory tests, according to departmental policies, CLIA law and regulatory standards. Determines identity and suitability of specimens when received, according to procedure policies. Assures test accuracy by performing and recording control testing, in accordance with departmental policy on each shift, as observed by the department supervisor. Performs tests in accordance with department policy and reports results in a timely manner, as documented in test records. Organizes daily work efficiently and expedites testing as observed by supervisor. Investigates problems and takes initiative in resolving them, both technical and non-technical, and communicates the outcome to the appropriate individual. Reviews integrity of report by checking legibility, completeness and credibility of results prior to releasing, as observed by supervisor. Accurately performs, records, transcribes or reviews proficiency testing material by the stated deadline, according to laboratory policy and CLIA law, as evidenced in the survey evaluation results. Maintains instrumentation and department supplies in order to ensure efficient departmental operations. Performs and documents assigned equipment maintenance duties per shift according to maintenance manuals. Restocks department supplies on a regular basis to ensure adequacy of inventory, as documented on department checklist. Recognizes problems in instrumentation, performs first line repairs as outlined in procedure manual and notifies appropriate individual if unable to adequately alleviate the problem. Promotes and contributes positively to intradepartmental and interdepartmental communications to ensure efficient departmental operation. Answers telephone and pages promptly and in a courteous manner, identifying self and department at all times. Assists in orientation and training of new associates unfamiliar with the department, as observed by the supervisor. Relays all appropriate information during shift hand-off, and ensures department is covered prior to leaving. Demonstrates good judgment in directing phone calls or questions to the appropriate department or individual. Promotes a safe working environment by regimented clean up and adherence to safety manuals. Adheres to established departmental guidelines for clean up on each shift. Protects self and co-workers by practicing safety precautions as established in safety and infection control manuals. Maintains appropriate departmental records and filing systems to ensure the expeditious retrieval of information and to comply with regulatory requirements. Ensures that results are properly filed and that data is recorded in a legible manner as evidenced in departmental records. Exhibits ability to perform essential computer operations pertaining to job duties. Performs outpatient ordering as defined in the department policy. Maintains a professional attitude in the work place regarding procedures, personnel and continuing education. Demonstrates a willingness to learn new procedures and instrumentation by attending training sessions and becoming familiar with new revisions as they occur. Reviews policies frequently, assists in updating contents and complies with procedures contained therein. Demonstrates flexibility by being able to work various departments and alternative shifts when requested. Exhibits willingness to help associates in their tasks when able. Accepts constructive criticism and feedback. Maintains necessary continuing education credits, keeps required competencies current, and maintains valid ASCP Certification. Qualifications What You Will Need: Must have a B.S. degree in Medical Technology, Medical Laboratory Science or a related science. Valid ASCP or AMT Technologist/Scientist Certification. At least one year of experience is preferred. Job Summary: Perform and report clinical laboratory analysis to assist physicians and other hospital staff in the diagnosis and monitoring of patients. This facility is an equal opportunity employer and complies with federal, state and local anti-discrimination laws, regulations and ordinances.4
 
0.5%
Palermo Villa Inc. is interested in a high-energy, poised and confident individual to assist in the development of concepts, products and optimization projects through Palermo's vigorous consumer-driven R&D process. The position will apply scientific and culinary principles in research and development. Develops the understanding of and ability to translate food trends into innovative opportunities, stimulate new food ideas and product concepts. Identify, evaluate and develop potential new product development opportunities. From bench-top samples to commercialized products and finished product specifications Assist in food product formulation from bench top to commercialization using a continuously developing skill set in food formulation and processing equipment capability understanding. Applies an analytical approach to the solution of a wide variety of problems and assimilates the details and significance of various scientific analyses, procedures, and tests Demonstrates initiative, creativity and thoroughness in the execution of complex projects Plans and conducts independent research projects and participates in the development of project objectives Contributes to the development of project strategies and recommends technical direction to management Evaluates technical trends in their specific area of expertise or assignment and makes recommendations for process or product improvements and identify opportunities for new or improved process or products Organize and direct sample development for sales presentations, consumer testing and food safety assurance Maintains written technical documentation and product and process specifications as pertaining to R&D Utilizes or directs internal (manufacturing, engineering, marketing, quality systems, procurement) and external (suppliers, consultants) functional experts to resolve issues. Assist in PR events, food shows and Sales presentations on key customer calls Provide technical support/serves as product development contact for Sales, Customer and Operations To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The position requires 5+ years' experience developing products within the food industry. Strong interpersonal and communication skills Ability to effectively present information to top management, public groups, and/or boards of directors. Ability to apply mathematical operations to such tasks as frequency distribution, determination of test reliability and validity, analysis of variance, correlation techniques, sampling theory, and factor analysis. Ability to define problems, collect data, establish facts, and draw valid conclusions. Strong computer skills are necessary Educational Requirements: Bachelor's Degree in Food Science, Biology, Chemistry, Culinary or equivalent4
 
0.5%
Under direct supervision of the Director of Database Marketing, the Marketing Data Analyst will work closely with members of the database marketing team and the FP&A marketing analysis team to derive insights from large amounts of customer and transactional data to develop segmentations, strategies, visualizations, reports, and recommendations for various marketing purposes. The Marketing Data Analyst will assist management with the interpretation, evaluation and interrelationship of data and generate integrated business analysis and projections to facilitate decision making. Essential Duties & Responsibilities Develop queries in SAS that create marketing campaigns to optimize profit and produce multi-channel campaign outputs. Design and evaluate various tests and optimizations of campaigns. Monitor the quality of all data at both the project and output level for the Database Marketing team. Support the integration of new data sources and analyze and confirm the overall quality and integrity of source data. Generate database extracts for Database Marketing teams as needed. Provide campaign analytics to extended team, including insights and recommendations to improve message effectiveness and campaign performance. Build business intelligence, reports and dashboards using software like SAS, Microsoft Excel/VBA, Tableau, or SAS Visual Analytics that include segment/campaign profitability and customer behavior or trends. Create relationships with internal stakeholders to discover how data, platform and tools can assist to execute business needs. Identify new business opportunities or potential risks based on data analysis on subject matters of various operations departments. Train users as needed. Perform other duties as assigned to support the efficient operation of the department. Education/Experience/Qualifications Bachelors or Masters Degree in Computer Science, Economics, Marketing, Finance, Mathematics, or related field required. 2+ years of experience with SAS and/or SQL and analyzing large datasets. Equivalent combination of education and progressive, relevant and direct experience may be considered in lieu of minimum educational/experience requirements indicated above. Advanced proficiency in Microsoft Excel and Word. Experience working with relational databases is required. Experience in programming/scripting. Experience with data visualization, reporting & dash boarding tools such as SAS visual Analytics or Tableau. Experience with Google Analytics custom reports and dashboards preferred. Familiarity with marketing methodologies and systems such as segmentation modeling, targeting, CRM, and ROI projections and evaluation. Predictive Modeling experience preferred. Employee must have experience demonstrating the utmost discretion and confidentiality as they will have access to confidential information including, but not limited to: customer contact information, customer financial data, and organizational financial data. Excellent communication skills, both written and verbal. Must be able to obtain/maintain any necessary certifications and/or licenses. Ability to mentor coordinators and administrative staff. Ability to work with mathematical concepts such as probability and statistical inference. Ability to apply concepts such as fractions, percentages, ratios, and proportions to practical situations, including the development of financial statistical models and forecasts. Ability to define problems, collect data, establish facts, and draw valid conclusions with minimal direction. Ability to interpret an extensive variety of technical instructions in mathematical or diagram form and deal with several abstract and concrete variables. Ability to effectively present information to, and respond to questions from, groups of managers and directors. Ability to read, analyze, and interpret general business periodicals, professional journals, technical procedures, governmental regulations, financial reports, and legal documents. Ability to respond to common inquiries or complaints from customers, regulatory agencies, or members of the business community. Certificates/Licenses/Registrations At the discretion of the San Manuel Tribal Gaming Commission you may be required to obtain and maintain a gaming license. San Manuel Band of Mission Indians and San Manuel Casino will make reasonable accommodations in compliance with the Americans with Disabilities Act of 1990. As one of the largest private employers in the Inland Empire, San Manuel deeply cares about the future, growth and well-being of its employees. Join our team today!4
 
0.5%
Responsibilities Include but may not be limited to: performing various tasks assisting in development of new items, renovation of existing formulations, and supports efforts to ensure quality product is produced, maintained, and documented. Additionally, this position will underwrite efforts in product development and distribution by our sales and prourement teams. This position will also be responsible for maintaining and entering data in several databases. As a member of the R&D Team, you will help develop products which can be reproduced in a large-scale food manufacturing environment. Assist in the development of new bean products from concept approval, formulation, product development, plant trial runs to launch and post-launch review by collaborating with Marketing , Sales, Project management, QA, and Production. Participate as an active member of cross-functional business development teams comprosed of individuals from a variety of desciplines, includjing Marketing, Finance, Purchasing and many others. Assist in redesign & renovation of existing products to increase quality, reduce costs, and/or increase production efficiencies. Partner internally and extenally to source new ingredients and leverage vendor expertise in ingredient functionality. Assist Quality and Procurement departments in maintaining specifications for new ingredients and/or suppliers. Supoport production with troubleshooting out of spec product or production concerns on established products. Maintain accurate product records, documentation and archives in various databases including global data synchronization of existing retail business. Maintain laboratory, including upkeep of equipment, stocking of supplies, and general cleaning of work areas. Performs other related and assigned duties as necessary. Minimun Qualifications Must hold a Bachelors degree in Food Science from an accredited University. Previous experience in food product development & food manufacturing strongly preferred. Ability and interest to work in laboratory, pilot plant and manufacturing scale environments. Proven ability to manage multiple assignments/tasks. Ability to work independently while collaborating and communicating with team members in various departments. Strong communication skills (oral and written). Knowlege of Genesis labeling system preferred but not required. Must be physically capable of lifting 50lbs. weight restriction.4
 
0.5%
What We Do: At the SEI Emerging Technology Center, we describe our work as “making the recently possible mission-practical.” We help our government customers stay at the leading edge of technology by identifying, demonstrating, extending, and applying emerging software technologies to solve real government problems. We currently work in the fields of human-machine interaction, applied artificial intelligence and machine learning, and advanced computing—areas that are changing and progressing rapidly. As we show our customers how new technologies can improve their mission capabilities through rapid prototyping and iterative development, we both rely on and shape academic and industrial research. Are you creative, curious, energetic, collaborative, technology-focused, and hard-working? Are you interested in making a difference by bringing innovation to government organizations and beyond? Apply to join our team. Position Summary: As a senior research scientist focusing on machine learning, you will identify, shape, apply, conduct, and lead research that matches critical U.S. government needs. Requirements: BS in Computer Science or related discipline with ten (10) years of experience; OR MS in the same fields with eight (8) years of experience; OR PhD with five (5) years of experience. Flexible to travel to other SEI offices in Pittsburgh and Washington, DC, sponsor sites, conferences, and offsite meetings on occasion. Moderate (25") travel outside of your home location. You will be subject to a background investigation and must be eligible to obtain and maintain a Department of Defense security clearance. Duties: Hands-on research: You’ll conduct and lead novel research in applied machine learning and artificial intelligence. Solution development: You’ll work with and lead interdisciplinary teams to turn research results into prototype operational capabilities for government customers and stakeholders. Strategy: You’ll work with Center leaders and colleagues to plan, develop, and carry out an overall research strategy, and to influence the national research agenda regarding future technology. Collaboration: You'll actively participate on teams of software developers, researchers, designers, and technical leads. You'll build relationships and collaborate with researchers, government customers, and other stakeholders to understand challenges, needs, possible solutions, and research directions. Mentoring: You'll contribute to improving the overall technical capabilities of the Center by mentoring and teaching others, participating in design (software and otherwise) sessions, and sharing insights and wisdom across the SEI Emerging Technology Center team. Knowledge, Skills, and Abilities: Deep technical knowledge: You have performed extensive research in applied machine learning and artificial intelligence. You have worked with tools, techniques, algorithms, software, and programming languages for deep learning, reinforcement learning, statistics, sensors and sensor fusion, planning, computer vision, or related areas. Communication and Collaboration: You have strong written and verbal communication skills and can interact collaboratively and diplomatically with customers and colleagues. You grasp the big picture, direction, and goals of an effort while focusing great attention to detail. You can present complex ideas to people who may not have a deep understanding of the subject area. Dedication: You can meet deadlines while multi-tasking–sometimes under pressure and with shifting priorities. Creativity and Innovation: You are creative and curious, and you are inspired by the prospect of collaborating with premier researchers and visionaries at Carnegie Mellon and other universities and organizations. You quickly learn new procedures, techniques, and approaches. You are forward-looking and can connect research with practical challenges. Knowledge and Learning: You possess broad technical interests along with a deep knowledge of a particular field such as human-computer interaction, data analytics and machine learning, advanced computing, and autonomy and adaptive systems. Desired Experience: Research practices and publications: You have a track record of conducting research in machine learning and artificial intelligence. You have a reputation for the highest level of research and technical integrity. You have demonstrated contributions and have published research. Familiarity with emerging trends and opportunities: You are familiar with technical challenges and emerging trends in computing and information science, and you are aware of opportunities in industry and government. Technical leadership: You have led research projects and have experience collaborating across research teams and mentoring other researchers. Proposals: You have formulated and delivered successful research proposals to funding agencies and led the resulting projects. Government projects: You have worked or are familiar with DARPA, IARPA, Service Labs, or other government research sponsors More Information Please visit “Why Carnegie Mellon” to learn more about becoming part of an institution inspiring innovations that change the world. A listing of employee benefits is available at: www.cmu.edu/jobs/benefits-at-a-glance/. Carnegie Mellon University is an Equal Opportunity Employer/Disability/Veteran.4
 
0.5%
We have an opportunity to join the Alliance as the Analytics Manager - Data Mart leading in the Analytics Services Department. WHAT YOU'LL BE RESPONSIBLE FOR Reporting to the Analytics Director, you will manage and lead the analytical data management function, including the gathering and assessment of business information needs for enterprise analytics and preparation of system requirements in order to create a single source of truth. Manage and lead the business information solution based Key Performance Indicator (KPI) dashboard reporting, customization, training and related integration with the Enterprise Data Warehouse (EDW). You will also manage, supervise, mentor and train assigned staff. ABOUT THE TEAM Our Analytics teams are skilled, focused, and highly collaborative. We work hard, have fun and take pride in how our work impacts the health outcomes for the communities we serve. THE IDEAL CANDIDATE WILL HAVE Passion and drive to roll up their sleeves and be a hands on manager Expertise and passion in the design, maintenance and evaluation of comprehensive analytical data marts Strength in developing data management plans, data dictionaries Experience and excitement for developing new teams and processes Strong communication skills and the ability to partner with cross-functional teams Ability to lead and inspire others, while promoting an environment that supports professional growth, embraces complex challenges and celebrates accomplishments WHAT YOU'LL NEED TO BE SUCCESSFUL To read the full position description, and list of requirements click here. Knowledge of: Data warehouse and analytical data mart concepts SQL Tools and techniques of data analysis and information reporting Thorough knowledge of information repository issues and concepts Joint Application Design (JAD) facilitation or other requirements-gathering techniques Relational database concepts and the creation of queries and reports using SQL and Tableau Data elements and their relationship to data quality requirements Ability to: Develop work plans and workflows and organize and prioritize analytical data mart activities Interpret and apply complex principles, policies, terms and procedures Define issues, interpret data, identify solutions, and make recommendations for action Independently document, summarize and resolve complex issues and projects Train, mentor, supervise, and evaluate the work of staff Education and Experience: Bachelor's degree in Computer Science, Information Science or a related field and seven years of experience performing data analysis and manipulation which included some experience leading or supervising staff; or an equivalent combination of education and experience may be qualifying OUR BENEFITS Medical, Dental and Vision Plans Ample Paid Time Off 11 Paid Holidays per year 401(a) Retirement Plan 457 Deferred Compensation Plan Robust Health and Wellness Program EV Charging Stations And many more ABOUT US We are a group of over 500 dedicated employees, committed to our mission of providing accessible, quality health care that is guided by local innovation. We feel that our work is bigger than ourselves. We leave work each day knowing that we made a difference in the community around us. Join us at Central California Alliance for Health (the Alliance), where you will be part of a culture that is respectful, diverse, professional and fun, and where you are empowered to do your best work. As a regional non-profit health plan, we serve over 330,000 members in Santa Cruz, Monterey and Merced counties. To learn more about us, click here or check out this video. At this time the Alliance does not provide any type of sponsorship. Applicants must be currently authorized to work in the United States on a full-time, ongoing basis without current or future needs for sponsorship.4
 
0.5%
As we strive to make a better day for our guests and team members, we look to enhance our enterprise applications dev team / master data efforts by adding someone with experience in Java. You will: 1. Develop solutions to support the initiative of moving our technology stack to the cloud 2. Maintain and develop solutions on SQL Server / PostgreSQL database leveraging tables, stored procedures, views, database roles, etc 3. Utilize a scripting language for automation of manual processes and manipulation/massage of data 4. Design solutions, document findings (gaps and risks), and communicate information and results to business partners in a concise and repeatable manner 5. Maintain up-to-date knowledge of industry standards for ETL tools and MDM technical solutions 6. Develop and maintain APIs using both MuleSoft and native EBX APIs Requirements: Java experience required. Experience with the Software Development Lifecycle (SDLC) required. Source control experience required. GITHUB, Subversion, or equivalent preferred Experience using query languages within relational database management systems (RDBMS). PostgreSQL and SQL Server are preferred. Python or shell scripting experience is a plus. .NET development experience is a plus. Release Management / Configuration Management / CICD experience a plus Experience with Maven, Jenkins, and SonarQube a plus Experience with large volumes of data using an established Enterprise Data Warehouse a plus Data extract, transform and load experience with an enterprise solution such as Informatica, SSIS, or Talend, is a plus. Experience using REST/SOAP APIs and MuleSoft experience a plus. Ability to troubleshoot and resolve issues independently is a plus. Attention to detail and strong problem solving skills desired. Ability to work as a member of a team to achieve stated goals. Job Type: Contract Experience: Java: 3 years (Required) SDLC: 2 years (Preferred) PostgreSQL and SQL: 2 years (Required) Location: Knoxville, TN (Required) Work authorization: United States (Required) Work Location: One location Benefits: Health insurance Schedule:: Monday to Friday4
 
0.5%
Day Shift: 7A-330P. Holidays and every other weekend. Summary: Performs, calculates and reports routine and special laboratory tests. Maintains equipment and troubleshoots problems. Evaluates results and quality control data. Serves as a resource and teaches new employees and students. Assists in evaluating new test procedures. The individual in this position must demonstrate knowledge of the principles of growth and development over the life span of the patient. In addition, she/he must possess the ability to assess patient data relative to age specific needs and provide care as described in the department's policies and procedures. Other information: Will consider entry level graduates of an approved associate degree program in an appropriate science field and eligibility as medical technologist.Must complete Point of Care Testing training as part of Department Orientation. Able to communicate effectively, pleasantly, cooperatively, and discretely with patients, physicians, hospital employees, and the general public. Able to work under pressure. Willingness to increase knowledge of laboratory/hospital functions. Demonstration of creativity, initiative and problem solving. Associate Degree in an appropriate science field and medical technologist certification or eligible. Responsibilities: Demonstrates technical knowledge and competence in performing expected responsibilities. Ability to perform laboratory skills. Performs and evaluates maintenance systems. Implements corrective action as appropriate. Performs and evaluates quality control data and implements corrective action. Ability to identify issues and processes requiring improvement. Ability to find, organize and use resources to improve outcomes. Monitors and evaluates training progress and makes recommendations for additional training. Assists in maintenance and ordering of supplies. Helps maintain organization and cleanliness of work/storage areas. Demonstrates the ability to function productively and independently, planning and prioritizing times and tasks to complete work assignments. Ability to maintain positive performance under a variety of conditions. Credentials: Essential: ASCP-MEDTECH - MLT OR MT Competencies and skills: Essential: Clear Communication Skills Both Written And Verbal Able To Keep Confidential Information Regarding Patients, Team Members Able To Withstand Crisis Situations Has Skills To Provides Customer Service To Patients, Team Members And Visitors Knowledge And Experience With Electronic Health Records Education: Essential: Associates Degree in related field Education specialization: Essential: Medical Technology Location: Millville All Services Shift : Flexible-hours/shifts may vary depending on department needs FTE: 0.500000 Work Status: Part Time >324
 
0.5%
ABL is seeking a Staff Scientist for the Downstream Process Development team. The candidate will be responsible for planning, development, and optimization, execution of assigned commercial and government client downstream development tasks. Working with external clients, R&D, Quality Control/ Quality Assurance and GMP Manufacturing, the scientist will provide expertise and scientific leadership for design, development, optimization, and production of protein therapeutics and viral vectors. The successful candidate will contribute to the team based execution of projects. The main responsibilities will include but not limited to follows: Develop robust, high-yield and scalable purification process (recombinant protein, virus and virus like particles) for Vaccine Development of Phase I/II candidates. Develop, optimize and scale-up protein purification methods to meet cGMP and Regulatory Compliance using Design of Experiment (DOE) methods. Lead efforts to evaluate different resins, filters, and analytical methods pertinent to purification development activities. Perform experiments using AKTA series Chromatography skids, TFF systems, and industry standard Harvest methods scale. Interacts with other departments involved in GMP manufacturing for planning production, testing and product release in a timely manner resulting in successful completion of projects. Participate in technology transfer of processes to manufacturing and from external clients, and from process development to manufacturing. Generate, manage, and maintain critical data in a highly organized manner in the form of notebook, protocol and SOP. Provide progress and developmental reports for assessment by clients. Develop and draft production batch records for GMP manufacturing, support and troubleshooting GMP production activities. Perform experiments and deliver results under minimal supervision, and within tight time lines, to a prescribed budget for internal / external client projects. Job Requirements This position requires a PhD in a life science discipline (Biochemistry, Analytical Chemistry Protein Chemistry or other related discipline), with 3-5 years of experience in Downstream Process Development, or an MS with 5-10 years’ experience, or a BS degree with more than 10 years of experience. Experience with cGMP manufacturing under cGMP/cGLP compliance a plus. Experience with AKTA purification systems. Computer skills using MS Office (Word, Excel, and Power Point). Proven leadership skills. Possess excellent interpersonal skills, both communications and written. Must be able to communicate effectively with all echelons of Management and staff. Task & Team-oriented, analytical, organized, detail-oriented, self-motivated & ability to multi-task. Travel Expectation None ABL, Inc. participates in E-Verify, an Internet-based system of the Department of Homeland Security (DHS) and Social Security Administration, that allows us to determine an employee's eligibility to work in the United States. ADDITIONAL INFORMATION: Candidate must meet all the requirements of our Company Occupational Health program as directed by the Occupational Health Consultant to include pre-employment physical and drug screen. Candidates are encouraged to submit a resume and a cover letter outlining background and experience as it relates to the position requirements and salary history/requirements. Please note that “negotiable” is neither salary nor requirements. Salary commensurate with experience. ABL, Inc. does not accept nor respond to unsolicited resumes from vendors, including recruitment agencies and search firms. Approved recruiting agencies must obtain prior approval from ABL, Inc. Human Resources in order to submit resumes to ABL, Inc. for consideration.4
 
0.5%
Who We Are! At Maven Wave, we are relentless in hiring the industry’s top talent. Each employee is hand-picked not only for their skills, but for their personality and broad expertise. We are looking for this rare combination of talent that sets us apart in the industry. Maven Wave helps leading companies make the shift to digital and shorten the fuse to innovation. We combine the expertise of top-tier consulting with the agility of a cutting-edge technology firm. This multidisciplinary blend of skills allows us to create unique digital advantages for our clients. Maven Wave’s digital solutions are agile, mobile, rooted in analytics, and built in the cloud. Maven Wave, Google, and YOU: Drive and deliver business results with data-based insights. We are looking for a Senior Data Scientist who will utilize their analytical, statistical, and programming skills to develop data-driven solutions to complex business challenges. Your Life As a Maven: Leverage company data to drive business solutions for enterprise clients using R and Python. Perform data collection for Data Science operations including Machine Learning. Develop custom data models and algorithms to apply to data sets. Use predictive modeling to increase and optimize customer experiences, revenue generation, ad targeting, and other business outcomes. Assess Model accuracy using common metrics (AUC, F1, etc.) and explain the results to client stakeholders. Your Expertise: 7+ years of experience manipulating data sets and building statistical models. Cloud experience in a major platform, such as AWS, GCP, or Azure. Experience using Data Science languages (R, Python) to manipulate data and draw insights. Knowledge of a variety of Machine Learning and advanced analytical techniques and their real world advantages/drawbacks. Familiarity with the following software/tools: Python, C, Java, Jupyter Notebooks, SQL, ML platforms (H2O, DataRobot), distributed data (Map/Reduce, Hadoop), and visualization (Tableau, qikview) Your X-Factor: Aptitude - You have an innate capacity to transition from project to project without skipping a beat. Communication - You have excellent written and verbal communication skills for coordination across projects and teams. Impact - You are a critical thinker with an emphasis on creativity and innovation. Passion - You have the drive to succeed paired with a continuous hunger to learn. Leadership - You are trusted, empathetic, accountable, and empower others around you. Why We’re Proud To Be Mavens! Google Cloud North America Services Partner of the Year 2019, 2018 #21 Best Workplaces in Chicago, FORTUNE, 2018 Great Place To Work Certification, Great Place to Work, 2017 & 2018 Fast Fifty, Crain's Chicago Business 101 Best and Brightest Companies to Work For, National Association for Business Resources (NABR) Top Google Cloud Partner, Clutch Fastest Growing Consulting Firms in North America (#11, #37), Consulting Magazine Top IT Services Companies, Clutch Google Global Rising Star Partner of the Year Ready to Learn More? Life as a Maven Check out the Apps and Data Team See what Glassdoor has to say Real Customer Stories3
 
0.4%
Other values (453)703
94.7%

Length

2022-10-02T10:04:08.208014image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
and23732
 
5.9%
to12880
 
3.2%
the9875
 
2.5%
of9471
 
2.4%
data7644
 
1.9%
in7083
 
1.8%
a6493
 
1.6%
with5980
 
1.5%
for4373
 
1.1%
experience3782
 
0.9%
Other values (13072)311377
77.3%

Most occurring characters

ValueCountFrequency (%)
378632
13.2%
e269480
 
9.4%
i198961
 
6.9%
a197301
 
6.9%
t194588
 
6.8%
n188423
 
6.6%
o168053
 
5.9%
s148718
 
5.2%
r148615
 
5.2%
l106053
 
3.7%
Other values (108)872379
30.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2280886
79.4%
Space Separator378647
 
13.2%
Uppercase Letter97486
 
3.4%
Other Punctuation59551
 
2.1%
Control30145
 
1.0%
Dash Punctuation9122
 
0.3%
Decimal Number7453
 
0.3%
Close Punctuation2611
 
0.1%
Open Punctuation2579
 
0.1%
Final Punctuation1294
 
< 0.1%
Other values (9)1429
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e269480
11.8%
i198961
 
8.7%
a197301
 
8.7%
t194588
 
8.5%
n188423
 
8.3%
o168053
 
7.4%
s148718
 
6.5%
r148615
 
6.5%
l106053
 
4.6%
c91769
 
4.0%
Other values (20)568925
24.9%
Uppercase Letter
ValueCountFrequency (%)
S10349
 
10.6%
A9041
 
9.3%
E7349
 
7.5%
D6544
 
6.7%
P6399
 
6.6%
T6206
 
6.4%
C6122
 
6.3%
I5365
 
5.5%
M5227
 
5.4%
R4459
 
4.6%
Other values (17)30425
31.2%
Other Punctuation
ValueCountFrequency (%)
,29385
49.3%
.17714
29.7%
/3506
 
5.9%
:3251
 
5.5%
'1172
 
2.0%
;886
 
1.5%
855
 
1.4%
&799
 
1.3%
*414
 
0.7%
?346
 
0.6%
Other values (9)1223
 
2.1%
Decimal Number
ValueCountFrequency (%)
01947
26.1%
21249
16.8%
11231
16.5%
5725
 
9.7%
3724
 
9.7%
4511
 
6.9%
7313
 
4.2%
8268
 
3.6%
6263
 
3.5%
9222
 
3.0%
Math Symbol
ValueCountFrequency (%)
+666
87.6%
>34
 
4.5%
=31
 
4.1%
|21
 
2.8%
~8
 
1.1%
Other Symbol
ValueCountFrequency (%)
85
58.6%
®39
26.9%
19
 
13.1%
©2
 
1.4%
Dash Punctuation
ValueCountFrequency (%)
-8900
97.6%
163
 
1.8%
59
 
0.6%
Close Punctuation
ValueCountFrequency (%)
)2532
97.0%
]74
 
2.8%
}5
 
0.2%
Open Punctuation
ValueCountFrequency (%)
(2529
98.1%
[42
 
1.6%
{8
 
0.3%
Space Separator
ValueCountFrequency (%)
378632
> 99.9%
15
 
< 0.1%
Final Punctuation
ValueCountFrequency (%)
1187
91.7%
107
 
8.3%
Initial Punctuation
ValueCountFrequency (%)
105
92.1%
9
 
7.9%
Other Number
ValueCountFrequency (%)
9
81.8%
³2
 
18.2%
Control
ValueCountFrequency (%)
30145
100.0%
Connector Punctuation
ValueCountFrequency (%)
_322
100.0%
Currency Symbol
ValueCountFrequency (%)
$66
100.0%
Format
ValueCountFrequency (%)
­8
100.0%
Private Use
ValueCountFrequency (%)
2
100.0%
Other Letter
ValueCountFrequency (%)
º1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2378373
82.8%
Common492828
 
17.2%
Unknown2
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
378632
76.8%
30145
 
6.1%
,29385
 
6.0%
.17714
 
3.6%
-8900
 
1.8%
/3506
 
0.7%
:3251
 
0.7%
)2532
 
0.5%
(2529
 
0.5%
01947
 
0.4%
Other values (49)14287
 
2.9%
Latin
ValueCountFrequency (%)
e269480
11.3%
i198961
 
8.4%
a197301
 
8.3%
t194588
 
8.2%
n188423
 
7.9%
o168053
 
7.1%
s148718
 
6.3%
r148615
 
6.2%
l106053
 
4.5%
c91769
 
3.9%
Other values (48)666412
28.0%
Unknown
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII2868136
99.9%
Punctuation2523
 
0.1%
None425
 
< 0.1%
Geometric Shapes85
 
< 0.1%
Letterlike Symbols19
 
< 0.1%
Number Forms9
 
< 0.1%
Alphabetic PF4
 
< 0.1%
PUA2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
378632
13.2%
e269480
 
9.4%
i198961
 
6.9%
a197301
 
6.9%
t194588
 
6.8%
n188423
 
6.6%
o168053
 
5.9%
s148718
 
5.2%
r148615
 
5.2%
l106053
 
3.7%
Other values (83)869312
30.3%
Punctuation
ValueCountFrequency (%)
1187
47.0%
855
33.9%
163
 
6.5%
107
 
4.2%
105
 
4.2%
59
 
2.3%
23
 
0.9%
15
 
0.6%
9
 
0.4%
None
ValueCountFrequency (%)
·337
79.3%
®39
 
9.2%
Â18
 
4.2%
­8
 
1.9%
ï8
 
1.9%
é5
 
1.2%
§3
 
0.7%
©2
 
0.5%
³2
 
0.5%
è2
 
0.5%
Geometric Shapes
ValueCountFrequency (%)
85
100.0%
Letterlike Symbols
ValueCountFrequency (%)
19
100.0%
Number Forms
ValueCountFrequency (%)
9
100.0%
Alphabetic PF
ValueCountFrequency (%)
4
100.0%
PUA
ValueCountFrequency (%)
2
100.0%

Rating
Real number (ℝ)

HIGH CORRELATION

Distinct31
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.618867925
Minimum-1
Maximum5
Zeros0
Zeros (%)0.0%
Negative11
Negative (%)1.5%
Memory size5.9 KiB
2022-10-02T10:04:08.501774image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum-1
5-th percentile2.6
Q13.3
median3.7
Q34
95-th percentile4.7
Maximum5
Range6
Interquartile range (IQR)0.7

Descriptive statistics

Standard deviation0.8012101585
Coefficient of variation (CV)0.2213980104
Kurtosis14.30412724
Mean3.618867925
Median Absolute Deviation (MAD)0.35
Skewness-2.814019554
Sum2685.2
Variance0.641937718
MonotonicityNot monotonic
2022-10-02T10:04:08.741132image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
3.963
 
8.5%
3.861
 
8.2%
3.761
 
8.2%
3.549
 
6.6%
447
 
6.3%
3.646
 
6.2%
3.444
 
5.9%
3.339
 
5.3%
3.235
 
4.7%
4.433
 
4.4%
Other values (21)264
35.6%
ValueCountFrequency (%)
-111
1.5%
1.93
 
0.4%
2.15
 
0.7%
2.22
 
0.3%
2.32
 
0.3%
2.47
0.9%
2.52
 
0.3%
2.612
1.6%
2.714
1.9%
2.87
0.9%
ValueCountFrequency (%)
55
 
0.7%
4.89
 
1.2%
4.731
4.2%
4.610
 
1.3%
4.57
 
0.9%
4.433
4.4%
4.332
4.3%
4.226
3.5%
4.119
2.6%
447
6.3%

Company Name
Categorical

HIGH CARDINALITY

Distinct343
Distinct (%)46.2%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
MassMutual 3.6
 
14
Reynolds American 3.1
 
14
Takeda Pharmaceuticals 3.7
 
14
Software Engineering Institute 2.6
 
11
PNNL 3.8
 
10
Other values (338)
679 

Length

Max length55
Median length37
Mean length19.18059299
Min length4

Characters and Unicode

Total characters14232
Distinct characters72
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique161 ?
Unique (%)21.7%

Sample

1st rowTecolote Research 3.8
2nd rowUniversity of Maryland Medical System 3.4
3rd rowKnowBe4 4.8
4th rowPNNL 3.8
5th rowAffinity Solutions 2.9

Common Values

ValueCountFrequency (%)
MassMutual 3.614
 
1.9%
Reynolds American 3.114
 
1.9%
Takeda Pharmaceuticals 3.714
 
1.9%
Software Engineering Institute 2.611
 
1.5%
PNNL 3.810
 
1.3%
Liberty Mutual Insurance 3.310
 
1.3%
AstraZeneca 3.99
 
1.2%
MITRE 3.28
 
1.1%
Numeric, LLC 3.27
 
0.9%
Advanced BioScience Laboratories 2.77
 
0.9%
Other values (333)638
86.0%

Length

2022-10-02T10:04:09.036065image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
3.963
 
2.8%
3.861
 
2.7%
3.761
 
2.7%
3.549
 
2.2%
4.047
 
2.1%
3.646
 
2.1%
3.444
 
2.0%
3.339
 
1.7%
health36
 
1.6%
3.235
 
1.6%
Other values (537)1759
78.5%

Most occurring characters

ValueCountFrequency (%)
e1019
 
7.2%
a891
 
6.3%
.768
 
5.4%
767
 
5.4%
731
 
5.1%
n688
 
4.8%
t685
 
4.8%
i677
 
4.8%
r657
 
4.6%
o631
 
4.4%
Other values (62)6718
47.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter8433
59.3%
Uppercase Letter1951
 
13.7%
Decimal Number1516
 
10.7%
Other Punctuation812
 
5.7%
Space Separator767
 
5.4%
Control731
 
5.1%
Dash Punctuation18
 
0.1%
Math Symbol4
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e1019
12.1%
a891
10.6%
n688
 
8.2%
t685
 
8.1%
i677
 
8.0%
r657
 
7.8%
o631
 
7.5%
s596
 
7.1%
l408
 
4.8%
c408
 
4.8%
Other values (16)1773
21.0%
Uppercase Letter
ValueCountFrequency (%)
C192
 
9.8%
S178
 
9.1%
T151
 
7.7%
A148
 
7.6%
I135
 
6.9%
L122
 
6.3%
M117
 
6.0%
P111
 
5.7%
R109
 
5.6%
E94
 
4.8%
Other values (16)594
30.4%
Decimal Number
ValueCountFrequency (%)
3520
34.3%
4304
20.1%
2152
 
10.0%
7107
 
7.1%
986
 
5.7%
079
 
5.2%
878
 
5.1%
670
 
4.6%
563
 
4.2%
157
 
3.8%
Other Punctuation
ValueCountFrequency (%)
.768
94.6%
,21
 
2.6%
&13
 
1.6%
'8
 
1.0%
/2
 
0.2%
Math Symbol
ValueCountFrequency (%)
<2
50.0%
>2
50.0%
Space Separator
ValueCountFrequency (%)
767
100.0%
Control
ValueCountFrequency (%)
731
100.0%
Dash Punctuation
ValueCountFrequency (%)
-18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin10384
73.0%
Common3848
 
27.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e1019
 
9.8%
a891
 
8.6%
n688
 
6.6%
t685
 
6.6%
i677
 
6.5%
r657
 
6.3%
o631
 
6.1%
s596
 
5.7%
l408
 
3.9%
c408
 
3.9%
Other values (42)3724
35.9%
Common
ValueCountFrequency (%)
.768
20.0%
767
19.9%
731
19.0%
3520
13.5%
4304
 
7.9%
2152
 
4.0%
7107
 
2.8%
986
 
2.2%
079
 
2.1%
878
 
2.0%
Other values (10)256
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII14232
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e1019
 
7.2%
a891
 
6.3%
.768
 
5.4%
767
 
5.4%
731
 
5.1%
n688
 
4.8%
t685
 
4.8%
i677
 
4.8%
r657
 
4.6%
o631
 
4.4%
Other values (62)6718
47.2%

Location
Categorical

HIGH CARDINALITY

Distinct200
Distinct (%)27.0%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
New York, NY
55 
San Francisco, CA
 
49
Cambridge, MA
 
47
Chicago, IL
 
32
Boston, MA
 
23
Other values (195)
536 

Length

Max length33
Median length22
Mean length13.1509434
Min length8

Characters and Unicode

Total characters9758
Distinct characters54
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)9.6%

Sample

1st rowAlbuquerque, NM
2nd rowLinthicum, MD
3rd rowClearwater, FL
4th rowRichland, WA
5th rowNew York, NY

Common Values

ValueCountFrequency (%)
New York, NY55
 
7.4%
San Francisco, CA49
 
6.6%
Cambridge, MA47
 
6.3%
Chicago, IL32
 
4.3%
Boston, MA23
 
3.1%
San Jose, CA13
 
1.8%
Pittsburgh, PA12
 
1.6%
Washington, DC11
 
1.5%
Rockville, MD11
 
1.5%
Winston-Salem, NC10
 
1.3%
Other values (190)479
64.6%

Length

2022-10-02T10:04:09.321308image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
ca152
 
8.8%
ma103
 
5.9%
san86
 
5.0%
ny72
 
4.2%
new57
 
3.3%
francisco57
 
3.3%
york55
 
3.2%
cambridge48
 
2.8%
va41
 
2.4%
il40
 
2.3%
Other values (256)1023
59.0%

Most occurring characters

ValueCountFrequency (%)
992
 
10.2%
,743
 
7.6%
a607
 
6.2%
o533
 
5.5%
n529
 
5.4%
e514
 
5.3%
i493
 
5.1%
A438
 
4.5%
r424
 
4.3%
l348
 
3.6%
Other values (44)4137
42.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter5525
56.6%
Uppercase Letter2488
25.5%
Space Separator992
 
10.2%
Other Punctuation743
 
7.6%
Dash Punctuation10
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A438
17.6%
C336
13.5%
N211
 
8.5%
M202
 
8.1%
S153
 
6.1%
Y133
 
5.3%
L106
 
4.3%
F93
 
3.7%
I89
 
3.6%
D87
 
3.5%
Other values (16)640
25.7%
Lowercase Letter
ValueCountFrequency (%)
a607
11.0%
o533
9.6%
n529
9.6%
e514
9.3%
i493
 
8.9%
r424
 
7.7%
l348
 
6.3%
t320
 
5.8%
s276
 
5.0%
c228
 
4.1%
Other values (15)1253
22.7%
Space Separator
ValueCountFrequency (%)
992
100.0%
Other Punctuation
ValueCountFrequency (%)
,743
100.0%
Dash Punctuation
ValueCountFrequency (%)
-10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin8013
82.1%
Common1745
 
17.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
a607
 
7.6%
o533
 
6.7%
n529
 
6.6%
e514
 
6.4%
i493
 
6.2%
A438
 
5.5%
r424
 
5.3%
l348
 
4.3%
C336
 
4.2%
t320
 
4.0%
Other values (41)3471
43.3%
Common
ValueCountFrequency (%)
992
56.8%
,743
42.6%
-10
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII9758
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
992
 
10.2%
,743
 
7.6%
a607
 
6.2%
o533
 
5.5%
n529
 
5.4%
e514
 
5.3%
i493
 
5.1%
A438
 
4.5%
r424
 
4.3%
l348
 
3.6%
Other values (44)4137
42.4%

Headquarters
Categorical

HIGH CARDINALITY

Distinct198
Distinct (%)26.7%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
New York, NY
 
52
San Francisco, CA
 
42
Chicago, IL
 
30
Cambridge, MA
 
20
Springfield, MA
 
14
Other values (193)
584 

Length

Max length26
Median length22
Mean length13.606469
Min length2

Characters and Unicode

Total characters10096
Distinct characters55
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)9.4%

Sample

1st rowGoleta, CA
2nd rowBaltimore, MD
3rd rowClearwater, FL
4th rowRichland, WA
5th rowNew York, NY

Common Values

ValueCountFrequency (%)
New York, NY52
 
7.0%
San Francisco, CA42
 
5.7%
Chicago, IL30
 
4.0%
Cambridge, MA20
 
2.7%
Springfield, MA14
 
1.9%
Boston, MA14
 
1.9%
Winston-Salem, NC14
 
1.9%
OSAKA, Japan14
 
1.9%
Richland, WA12
 
1.6%
Reston, VA12
 
1.6%
Other values (188)518
69.8%

Length

2022-10-02T10:04:09.568227image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
ca169
 
9.5%
ma86
 
4.8%
san76
 
4.3%
ny63
 
3.5%
new55
 
3.1%
va53
 
3.0%
york52
 
2.9%
francisco46
 
2.6%
il34
 
1.9%
chicago30
 
1.7%
Other values (261)1111
62.6%

Most occurring characters

ValueCountFrequency (%)
1033
 
10.2%
,741
 
7.3%
a647
 
6.4%
n615
 
6.1%
e560
 
5.5%
o528
 
5.2%
i510
 
5.1%
A459
 
4.5%
r422
 
4.2%
l370
 
3.7%
Other values (45)4211
41.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter5779
57.2%
Uppercase Letter2525
25.0%
Space Separator1033
 
10.2%
Other Punctuation741
 
7.3%
Dash Punctuation17
 
0.2%
Decimal Number1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A459
18.2%
C350
13.9%
N192
 
7.6%
S181
 
7.2%
M178
 
7.0%
Y115
 
4.6%
L107
 
4.2%
F102
 
4.0%
I82
 
3.2%
P80
 
3.2%
Other values (16)679
26.9%
Lowercase Letter
ValueCountFrequency (%)
a647
11.2%
n615
10.6%
e560
9.7%
o528
9.1%
i510
 
8.8%
r422
 
7.3%
l370
 
6.4%
t348
 
6.0%
s259
 
4.5%
d237
 
4.1%
Other values (15)1283
22.2%
Space Separator
ValueCountFrequency (%)
1033
100.0%
Other Punctuation
ValueCountFrequency (%)
,741
100.0%
Dash Punctuation
ValueCountFrequency (%)
-17
100.0%
Decimal Number
ValueCountFrequency (%)
11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin8304
82.3%
Common1792
 
17.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
a647
 
7.8%
n615
 
7.4%
e560
 
6.7%
o528
 
6.4%
i510
 
6.1%
A459
 
5.5%
r422
 
5.1%
l370
 
4.5%
C350
 
4.2%
t348
 
4.2%
Other values (41)3495
42.1%
Common
ValueCountFrequency (%)
1033
57.6%
,741
41.4%
-17
 
0.9%
11
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII10096
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1033
 
10.2%
,741
 
7.3%
a647
 
6.4%
n615
 
6.1%
e560
 
5.5%
o528
 
5.2%
i510
 
5.1%
A459
 
4.5%
r422
 
4.2%
l370
 
3.7%
Other values (45)4211
41.7%

Size
Categorical

HIGH CORRELATION

Distinct8
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
1001 - 5000
150 
501 - 1000
134 
10000+
130 
201 - 500
117 
51 - 200
94 
Other values (3)
117 

Length

Max length13
Median length11
Mean length10.07412399
Min length7

Characters and Unicode

Total characters7475
Distinct characters12
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row501 - 1000
2nd row10000+
3rd row501 - 1000
4th row1001 - 5000
5th row51 - 200

Common Values

ValueCountFrequency (%)
1001 - 5000 150
20.2%
501 - 1000 134
18.1%
10000+ 130
17.5%
201 - 500 117
15.8%
51 - 200 94
12.7%
5001 - 10000 76
10.2%
1 - 50 31
 
4.2%
unknown10
 
1.3%

Length

2022-10-02T10:04:09.841456image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:10.216236image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
602
30.9%
10000206
 
10.6%
1001150
 
7.7%
5000150
 
7.7%
501134
 
6.9%
1000134
 
6.9%
201117
 
6.0%
500117
 
6.0%
5194
 
4.8%
20094
 
4.8%
Other values (4)148
 
7.6%

Most occurring characters

ValueCountFrequency (%)
02832
37.9%
1936
25.9%
11092
 
14.6%
-602
 
8.1%
5602
 
8.1%
2211
 
2.8%
+130
 
1.7%
n30
 
0.4%
u10
 
0.1%
k10
 
0.1%
Other values (2)20
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number4737
63.4%
Space Separator1936
25.9%
Dash Punctuation602
 
8.1%
Math Symbol130
 
1.7%
Lowercase Letter70
 
0.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n30
42.9%
u10
 
14.3%
k10
 
14.3%
o10
 
14.3%
w10
 
14.3%
Decimal Number
ValueCountFrequency (%)
02832
59.8%
11092
 
23.1%
5602
 
12.7%
2211
 
4.5%
Space Separator
ValueCountFrequency (%)
1936
100.0%
Dash Punctuation
ValueCountFrequency (%)
-602
100.0%
Math Symbol
ValueCountFrequency (%)
+130
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7405
99.1%
Latin70
 
0.9%

Most frequent character per script

Common
ValueCountFrequency (%)
02832
38.2%
1936
26.1%
11092
 
14.7%
-602
 
8.1%
5602
 
8.1%
2211
 
2.8%
+130
 
1.8%
Latin
ValueCountFrequency (%)
n30
42.9%
u10
 
14.3%
k10
 
14.3%
o10
 
14.3%
w10
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII7475
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
02832
37.9%
1936
25.9%
11092
 
14.6%
-602
 
8.1%
5602
 
8.1%
2211
 
2.8%
+130
 
1.7%
n30
 
0.4%
u10
 
0.1%
k10
 
0.1%
Other values (2)20
 
0.3%

Founded
Real number (ℝ)

HIGH CORRELATION

Distinct102
Distinct (%)13.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1837.154987
Minimum-1
Maximum2019
Zeros0
Zeros (%)0.0%
Negative50
Negative (%)6.7%
Memory size5.9 KiB
2022-10-02T10:04:10.550321image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum-1
5-th percentile-1
Q11939
median1988
Q32007
95-th percentile2014
Maximum2019
Range2020
Interquartile range (IQR)68

Descriptive statistics

Standard deviation497.1837627
Coefficient of variation (CV)0.270627011
Kurtosis9.705374859
Mean1837.154987
Median Absolute Deviation (MAD)22
Skewness-3.394532023
Sum1363169
Variance247191.6939
MonotonicityNot monotonic
2022-10-02T10:04:10.853505image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-150
 
6.7%
201032
 
4.3%
200831
 
4.2%
199627
 
3.6%
200624
 
3.2%
201221
 
2.8%
201119
 
2.6%
195818
 
2.4%
200718
 
2.4%
198418
 
2.4%
Other values (92)484
65.2%
ValueCountFrequency (%)
-150
6.7%
17441
 
0.1%
178114
 
1.9%
18121
 
0.1%
18304
 
0.5%
18462
 
0.3%
18497
 
0.9%
18501
 
0.1%
185114
 
1.9%
18525
 
0.7%
ValueCountFrequency (%)
20192
 
0.3%
201712
 
1.6%
20165
 
0.7%
201516
2.2%
201413
1.8%
201315
2.0%
201221
2.8%
201119
2.6%
201032
4.3%
20096
 
0.8%

Type of ownership
Categorical

HIGH CORRELATION

Distinct9
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
Company - Private
410 
Company - Public
193 
Nonprofit Organization
55 
Subsidiary or Business Segment
 
34
Government
 
15
Other values (4)
 
35

Length

Max length30
Median length17
Mean length17.46091644
Min length8

Characters and Unicode

Total characters12956
Distinct characters34
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCompany - Private
2nd rowOther Organization
3rd rowCompany - Private
4th rowGovernment
5th rowCompany - Private

Common Values

ValueCountFrequency (%)
Company - Private410
55.3%
Company - Public193
26.0%
Nonprofit Organization55
 
7.4%
Subsidiary or Business Segment34
 
4.6%
Government15
 
2.0%
Hospital15
 
2.0%
College / University13
 
1.8%
Other Organization5
 
0.7%
School / School District2
 
0.3%

Length

2022-10-02T10:04:11.131784image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:11.435915image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
618
28.9%
company603
28.2%
private410
19.1%
public193
 
9.0%
organization60
 
2.8%
nonprofit55
 
2.6%
segment34
 
1.6%
business34
 
1.6%
or34
 
1.6%
subsidiary34
 
1.6%
Other values (7)67
 
3.1%

Most occurring characters

ValueCountFrequency (%)
1400
 
10.8%
a1182
 
9.1%
i925
 
7.1%
n889
 
6.9%
o858
 
6.6%
p673
 
5.2%
m652
 
5.0%
y650
 
5.0%
r628
 
4.8%
C616
 
4.8%
Other values (24)4483
34.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter9448
72.9%
Uppercase Letter1490
 
11.5%
Space Separator1400
 
10.8%
Dash Punctuation603
 
4.7%
Other Punctuation15
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a1182
12.5%
i925
9.8%
n889
9.4%
o858
9.1%
p673
 
7.1%
m652
 
6.9%
y650
 
6.9%
r628
 
6.6%
t611
 
6.5%
e586
 
6.2%
Other values (11)1794
19.0%
Uppercase Letter
ValueCountFrequency (%)
C616
41.3%
P603
40.5%
S72
 
4.8%
O65
 
4.4%
N55
 
3.7%
B34
 
2.3%
G15
 
1.0%
H15
 
1.0%
U13
 
0.9%
D2
 
0.1%
Space Separator
ValueCountFrequency (%)
1400
100.0%
Dash Punctuation
ValueCountFrequency (%)
-603
100.0%
Other Punctuation
ValueCountFrequency (%)
/15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin10938
84.4%
Common2018
 
15.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
a1182
 
10.8%
i925
 
8.5%
n889
 
8.1%
o858
 
7.8%
p673
 
6.2%
m652
 
6.0%
y650
 
5.9%
r628
 
5.7%
C616
 
5.6%
t611
 
5.6%
Other values (21)3254
29.7%
Common
ValueCountFrequency (%)
1400
69.4%
-603
29.9%
/15
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII12956
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1400
 
10.8%
a1182
 
9.1%
i925
 
7.1%
n889
 
6.9%
o858
 
6.6%
p673
 
5.2%
m652
 
5.0%
y650
 
5.0%
r628
 
4.8%
C616
 
4.8%
Other values (24)4483
34.6%

Industry
Categorical

HIGH CARDINALITY
HIGH CORRELATION

Distinct60
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
Biotech & Pharmaceuticals
112 
Insurance Carriers
63 
Computer Hardware & Software
59 
IT Services
50 
Health Care Services & Hospitals
49 
Other values (55)
409 

Length

Max length40
Median length35
Mean length21.9083558
Min length2

Characters and Unicode

Total characters16256
Distinct characters52
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)1.5%

Sample

1st rowAerospace & Defense
2nd rowHealth Care Services & Hospitals
3rd rowSecurity Services
4th rowEnergy
5th rowAdvertising & Marketing

Common Values

ValueCountFrequency (%)
Biotech & Pharmaceuticals112
15.1%
Insurance Carriers63
 
8.5%
Computer Hardware & Software59
 
8.0%
IT Services50
 
6.7%
Health Care Services & Hospitals49
 
6.6%
Enterprise Software & Network Solutions42
 
5.7%
Internet29
 
3.9%
Consulting29
 
3.9%
Aerospace & Defense25
 
3.4%
Advertising & Marketing25
 
3.4%
Other values (50)259
34.9%

Length

2022-10-02T10:04:11.787251image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
416
19.4%
services120
 
5.6%
biotech112
 
5.2%
pharmaceuticals112
 
5.2%
software101
 
4.7%
insurance69
 
3.2%
carriers63
 
2.9%
computer59
 
2.7%
hardware59
 
2.7%
health51
 
2.4%
Other values (101)986
45.9%

Most occurring characters

ValueCountFrequency (%)
e1784
 
11.0%
1406
 
8.6%
r1340
 
8.2%
a1218
 
7.5%
t1048
 
6.4%
i982
 
6.0%
s969
 
6.0%
n848
 
5.2%
c763
 
4.7%
o746
 
4.6%
Other values (42)5152
31.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter12614
77.6%
Uppercase Letter1774
 
10.9%
Space Separator1406
 
8.6%
Other Punctuation430
 
2.6%
Decimal Number18
 
0.1%
Dash Punctuation14
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e1784
14.1%
r1340
10.6%
a1218
9.7%
t1048
8.3%
i982
7.8%
s969
7.7%
n848
 
6.7%
c763
 
6.0%
o746
 
5.9%
u501
 
4.0%
Other values (15)2415
19.1%
Uppercase Letter
ValueCountFrequency (%)
S313
17.6%
C267
15.1%
H159
9.0%
I157
8.9%
B151
8.5%
P143
8.1%
A98
 
5.5%
E79
 
4.5%
T78
 
4.4%
M73
 
4.1%
Other values (11)256
14.4%
Other Punctuation
ValueCountFrequency (%)
&416
96.7%
,14
 
3.3%
Decimal Number
ValueCountFrequency (%)
114
77.8%
24
 
22.2%
Space Separator
ValueCountFrequency (%)
1406
100.0%
Dash Punctuation
ValueCountFrequency (%)
-14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin14388
88.5%
Common1868
 
11.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e1784
12.4%
r1340
 
9.3%
a1218
 
8.5%
t1048
 
7.3%
i982
 
6.8%
s969
 
6.7%
n848
 
5.9%
c763
 
5.3%
o746
 
5.2%
u501
 
3.5%
Other values (36)4189
29.1%
Common
ValueCountFrequency (%)
1406
75.3%
&416
 
22.3%
,14
 
0.7%
114
 
0.7%
-14
 
0.7%
24
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII16256
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e1784
 
11.0%
1406
 
8.6%
r1340
 
8.2%
a1218
 
7.5%
t1048
 
6.4%
i982
 
6.0%
s969
 
6.0%
n848
 
5.2%
c763
 
4.7%
o746
 
4.6%
Other values (42)5152
31.7%

Sector
Categorical

HIGH CORRELATION

Distinct25
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
Information Technology
180 
Biotech & Pharmaceuticals
112 
Business Services
97 
Insurance
69 
Health Care
49 
Other values (20)
235 

Length

Max length34
Median length28
Mean length17.02695418
Min length2

Characters and Unicode

Total characters12634
Distinct characters42
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st rowAerospace & Defense
2nd rowHealth Care
3rd rowBusiness Services
4th rowOil, Gas, Energy & Utilities
5th rowBusiness Services

Common Values

ValueCountFrequency (%)
Information Technology180
24.3%
Biotech & Pharmaceuticals112
15.1%
Business Services97
13.1%
Insurance69
 
9.3%
Health Care49
 
6.6%
Finance42
 
5.7%
Manufacturing34
 
4.6%
Aerospace & Defense25
 
3.4%
Education23
 
3.1%
Retail15
 
2.0%
Other values (15)96
12.9%

Length

2022-10-02T10:04:12.087753image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
information180
12.2%
technology180
12.2%
179
12.2%
biotech112
 
7.6%
pharmaceuticals112
 
7.6%
services101
 
6.9%
business97
 
6.6%
insurance69
 
4.7%
health49
 
3.3%
care49
 
3.3%
Other values (34)345
23.4%

Most occurring characters

ValueCountFrequency (%)
e1179
 
9.3%
n1091
 
8.6%
o969
 
7.7%
a943
 
7.5%
i856
 
6.8%
c843
 
6.7%
731
 
5.8%
s712
 
5.6%
r662
 
5.2%
t654
 
5.2%
Other values (32)3994
31.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter10367
82.1%
Uppercase Letter1293
 
10.2%
Space Separator731
 
5.8%
Other Punctuation214
 
1.7%
Dash Punctuation19
 
0.2%
Decimal Number10
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e1179
11.4%
n1091
10.5%
o969
9.3%
a943
9.1%
i856
8.3%
c843
8.1%
s712
 
6.9%
r662
 
6.4%
t654
 
6.3%
h453
 
4.4%
Other values (9)2005
19.3%
Uppercase Letter
ValueCountFrequency (%)
I249
19.3%
T210
16.2%
B209
16.2%
P121
9.4%
S101
7.8%
C56
 
4.3%
E49
 
3.8%
H49
 
3.8%
M49
 
3.8%
F43
 
3.3%
Other values (8)157
12.1%
Other Punctuation
ValueCountFrequency (%)
&179
83.6%
,35
 
16.4%
Space Separator
ValueCountFrequency (%)
731
100.0%
Dash Punctuation
ValueCountFrequency (%)
-19
100.0%
Decimal Number
ValueCountFrequency (%)
110
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin11660
92.3%
Common974
 
7.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e1179
 
10.1%
n1091
 
9.4%
o969
 
8.3%
a943
 
8.1%
i856
 
7.3%
c843
 
7.2%
s712
 
6.1%
r662
 
5.7%
t654
 
5.6%
h453
 
3.9%
Other values (27)3298
28.3%
Common
ValueCountFrequency (%)
731
75.1%
&179
 
18.4%
,35
 
3.6%
-19
 
2.0%
110
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII12634
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e1179
 
9.3%
n1091
 
8.6%
o969
 
7.7%
a943
 
7.5%
i856
 
6.8%
c843
 
6.7%
731
 
5.8%
s712
 
5.6%
r662
 
5.2%
t654
 
5.2%
Other values (32)3994
31.6%

Revenue
Categorical

HIGH CORRELATION

Distinct13
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
Unknown / Non-Applicable
204 
$10+ billion (USD)
124 
$100 to $500 million (USD)
91 
$1 to $2 billion (USD)
60 
$500 million to $1 billion (USD)
57 
Other values (8)
206 

Length

Max length32
Median length26
Mean length23.5916442
Min length18

Characters and Unicode

Total characters17505
Distinct characters32
Distinct categories10 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row$50 to $100 million (USD)
2nd row$2 to $5 billion (USD)
3rd row$100 to $500 million (USD)
4th row$500 million to $1 billion (USD)
5th rowUnknown / Non-Applicable

Common Values

ValueCountFrequency (%)
Unknown / Non-Applicable204
27.5%
$10+ billion (USD)124
16.7%
$100 to $500 million (USD)91
12.3%
$1 to $2 billion (USD)60
 
8.1%
$500 million to $1 billion (USD)57
 
7.7%
$50 to $100 million (USD)46
 
6.2%
$25 to $50 million (USD)40
 
5.4%
$2 to $5 billion (USD)39
 
5.3%
$10 to $25 million (USD)32
 
4.3%
$5 to $10 billion (USD)19
 
2.6%
Other values (3)30
 
4.0%

Length

2022-10-02T10:04:12.342433image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
usd538
17.3%
to410
13.2%
billion299
9.6%
million296
9.5%
unknown204
 
6.6%
204
 
6.6%
non-applicable204
 
6.6%
10193
 
6.2%
500148
 
4.8%
100137
 
4.4%
Other values (7)478
15.4%

Most occurring characters

ValueCountFrequency (%)
2369
13.5%
l1598
 
9.1%
n1415
 
8.1%
o1413
 
8.1%
i1394
 
8.0%
$948
 
5.4%
0849
 
4.9%
U742
 
4.2%
D538
 
3.1%
)538
 
3.1%
Other values (22)5701
32.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter8481
48.4%
Space Separator2369
 
13.5%
Uppercase Letter2230
 
12.7%
Decimal Number1869
 
10.7%
Currency Symbol948
 
5.4%
Close Punctuation538
 
3.1%
Open Punctuation538
 
3.1%
Dash Punctuation204
 
1.2%
Other Punctuation204
 
1.2%
Math Symbol124
 
0.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
l1598
18.8%
n1415
16.7%
o1413
16.7%
i1394
16.4%
b503
 
5.9%
t414
 
4.9%
p408
 
4.8%
m296
 
3.5%
e208
 
2.5%
a208
 
2.5%
Other values (5)624
 
7.4%
Uppercase Letter
ValueCountFrequency (%)
U742
33.3%
D538
24.1%
S538
24.1%
A204
 
9.1%
N204
 
9.1%
L4
 
0.2%
Decimal Number
ValueCountFrequency (%)
0849
45.4%
1459
24.6%
5390
20.9%
2171
 
9.1%
Space Separator
ValueCountFrequency (%)
2369
100.0%
Currency Symbol
ValueCountFrequency (%)
$948
100.0%
Close Punctuation
ValueCountFrequency (%)
)538
100.0%
Open Punctuation
ValueCountFrequency (%)
(538
100.0%
Dash Punctuation
ValueCountFrequency (%)
-204
100.0%
Other Punctuation
ValueCountFrequency (%)
/204
100.0%
Math Symbol
ValueCountFrequency (%)
+124
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin10711
61.2%
Common6794
38.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
l1598
14.9%
n1415
13.2%
o1413
13.2%
i1394
13.0%
U742
6.9%
D538
 
5.0%
S538
 
5.0%
b503
 
4.7%
t414
 
3.9%
p408
 
3.8%
Other values (11)1748
16.3%
Common
ValueCountFrequency (%)
2369
34.9%
$948
14.0%
0849
 
12.5%
)538
 
7.9%
(538
 
7.9%
1459
 
6.8%
5390
 
5.7%
-204
 
3.0%
/204
 
3.0%
2171
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII17505
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2369
13.5%
l1598
 
9.1%
n1415
 
8.1%
o1413
 
8.1%
i1394
 
8.0%
$948
 
5.4%
0849
 
4.9%
U742
 
4.2%
D538
 
3.1%
)538
 
3.1%
Other values (22)5701
32.6%

Competitors
Categorical

HIGH CARDINALITY

Distinct128
Distinct (%)17.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
-1
460 
Novartis, Baxter, Pfizer
 
14
Oak Ridge National Laboratory, National Renewable Energy Lab, Los Alamos National Laboratory
 
12
Travelers, Allstate, State Farm
 
10
Roche, GlaxoSmithKline, Novartis
 
9
Other values (123)
237 

Length

Max length92
Median length2
Mean length15.97439353
Min length2

Characters and Unicode

Total characters11853
Distinct characters63
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)7.3%

Sample

1st row-1
2nd row-1
3rd row-1
4th rowOak Ridge National Laboratory, National Renewable Energy Lab, Los Alamos National Laboratory
5th rowCommerce Signals, Cardlytics, Yodlee

Common Values

ValueCountFrequency (%)
-1460
62.0%
Novartis, Baxter, Pfizer14
 
1.9%
Oak Ridge National Laboratory, National Renewable Energy Lab, Los Alamos National Laboratory12
 
1.6%
Travelers, Allstate, State Farm10
 
1.3%
Roche, GlaxoSmithKline, Novartis9
 
1.2%
Battelle, General Atomics, SAIC8
 
1.1%
Expedia Group, Orbitz Worldwide, Priceline.com7
 
0.9%
Pitney Bowes6
 
0.8%
Leidos, CACI International, Booz Allen Hamilton6
 
0.8%
FLURRY, Chartboost6
 
0.8%
Other values (118)204
27.5%

Length

2022-10-02T10:04:12.642235image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1460
 
25.5%
national44
 
2.4%
laboratory28
 
1.6%
novartis25
 
1.4%
pfizer18
 
1.0%
group18
 
1.0%
16
 
0.9%
los15
 
0.8%
alamos15
 
0.8%
glaxosmithkline15
 
0.8%
Other values (433)1152
63.8%

Most occurring characters

ValueCountFrequency (%)
1064
 
9.0%
e940
 
7.9%
a822
 
6.9%
o672
 
5.7%
t654
 
5.5%
r634
 
5.3%
i626
 
5.3%
n557
 
4.7%
,500
 
4.2%
l490
 
4.1%
Other values (53)4894
41.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter7627
64.3%
Uppercase Letter1682
 
14.2%
Space Separator1064
 
9.0%
Other Punctuation541
 
4.6%
Dash Punctuation467
 
3.9%
Decimal Number462
 
3.9%
Math Symbol4
 
< 0.1%
Open Punctuation3
 
< 0.1%
Close Punctuation3
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S156
 
9.3%
A156
 
9.3%
C144
 
8.6%
N118
 
7.0%
L113
 
6.7%
T99
 
5.9%
R94
 
5.6%
I86
 
5.1%
B86
 
5.1%
M74
 
4.4%
Other values (16)556
33.1%
Lowercase Letter
ValueCountFrequency (%)
e940
12.3%
a822
10.8%
o672
8.8%
t654
8.6%
r634
8.3%
i626
8.2%
n557
 
7.3%
l490
 
6.4%
s400
 
5.2%
c289
 
3.8%
Other values (15)1543
20.2%
Other Punctuation
ValueCountFrequency (%)
,500
92.4%
.19
 
3.5%
&13
 
2.4%
'9
 
1.7%
Decimal Number
ValueCountFrequency (%)
1460
99.6%
92
 
0.4%
Math Symbol
ValueCountFrequency (%)
|2
50.0%
+2
50.0%
Space Separator
ValueCountFrequency (%)
1064
100.0%
Dash Punctuation
ValueCountFrequency (%)
-467
100.0%
Open Punctuation
ValueCountFrequency (%)
(3
100.0%
Close Punctuation
ValueCountFrequency (%)
)3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin9309
78.5%
Common2544
 
21.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e940
 
10.1%
a822
 
8.8%
o672
 
7.2%
t654
 
7.0%
r634
 
6.8%
i626
 
6.7%
n557
 
6.0%
l490
 
5.3%
s400
 
4.3%
c289
 
3.1%
Other values (41)3225
34.6%
Common
ValueCountFrequency (%)
1064
41.8%
,500
19.7%
-467
18.4%
1460
18.1%
.19
 
0.7%
&13
 
0.5%
'9
 
0.4%
(3
 
0.1%
)3
 
0.1%
|2
 
0.1%
Other values (2)4
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII11853
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1064
 
9.0%
e940
 
7.9%
a822
 
6.9%
o672
 
5.7%
t654
 
5.5%
r634
 
5.3%
i626
 
5.3%
n557
 
4.7%
,500
 
4.2%
l490
 
4.1%
Other values (53)4894
41.3%

Hourly
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
718 
1
 
24

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0718
96.8%
124
 
3.2%

Length

2022-10-02T10:04:12.895002image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:13.132467image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0718
96.8%
124
 
3.2%

Most occurring characters

ValueCountFrequency (%)
0718
96.8%
124
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0718
96.8%
124
 
3.2%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0718
96.8%
124
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0718
96.8%
124
 
3.2%

Employer provided
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
725 
1
 
17

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0725
97.7%
117
 
2.3%

Length

2022-10-02T10:04:13.321229image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:13.563189image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0725
97.7%
117
 
2.3%

Most occurring characters

ValueCountFrequency (%)
0725
97.7%
117
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0725
97.7%
117
 
2.3%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0725
97.7%
117
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0725
97.7%
117
 
2.3%

Lower Salary
Real number (ℝ≥0)

HIGH CORRELATION

Distinct113
Distinct (%)15.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74.75471698
Minimum15
Maximum202
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.9 KiB
2022-10-02T10:04:13.800655image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile35.05
Q152
median69.5
Q391
95-th percentile127
Maximum202
Range187
Interquartile range (IQR)39

Descriptive statistics

Standard deviation30.94589241
Coefficient of variation (CV)0.4139657491
Kurtosis1.967121091
Mean74.75471698
Median Absolute Deviation (MAD)19.5
Skewness1.113064296
Sum55468
Variance957.6482571
MonotonicityNot monotonic
2022-10-02T10:04:14.090568image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4322
 
3.0%
6520
 
2.7%
6118
 
2.4%
8018
 
2.4%
5218
 
2.4%
4918
 
2.4%
8117
 
2.3%
7416
 
2.2%
6316
 
2.2%
5616
 
2.2%
Other values (103)563
75.9%
ValueCountFrequency (%)
151
 
0.1%
203
 
0.4%
261
 
0.1%
272
 
0.3%
291
 
0.1%
318
1.1%
326
0.8%
333
 
0.4%
345
0.7%
358
1.1%
ValueCountFrequency (%)
2023
0.4%
2003
0.4%
1903
0.4%
1761
 
0.1%
1711
 
0.1%
1582
 
0.3%
1507
0.9%
1393
0.4%
1383
0.4%
1361
 
0.1%

Upper Salary
Real number (ℝ≥0)

HIGH CORRELATION

Distinct162
Distinct (%)21.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.2142857
Minimum16
Maximum306
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.9 KiB
2022-10-02T10:04:14.390010image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum16
5-th percentile62
Q196
median124
Q3155
95-th percentile208
Maximum306
Range290
Interquartile range (IQR)59

Descriptive statistics

Standard deviation45.12864994
Coefficient of variation (CV)0.3519783282
Kurtosis0.6122000821
Mean128.2142857
Median Absolute Deviation (MAD)29
Skewness0.6330608288
Sum95135
Variance2036.595045
MonotonicityNot monotonic
2022-10-02T10:04:14.658847image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14016
 
2.2%
11915
 
2.0%
11015
 
2.0%
12415
 
2.0%
11313
 
1.8%
12713
 
1.8%
8612
 
1.6%
17312
 
1.6%
10112
 
1.6%
13911
 
1.5%
Other values (152)608
81.9%
ValueCountFrequency (%)
161
 
0.1%
352
 
0.3%
391
 
0.1%
482
 
0.3%
492
 
0.3%
501
 
0.1%
525
0.7%
553
0.4%
573
0.4%
583
0.4%
ValueCountFrequency (%)
3063
0.4%
2891
 
0.1%
2751
 
0.1%
2721
 
0.1%
2502
0.3%
2392
0.3%
2382
0.3%
2311
 
0.1%
2282
0.3%
2243
0.4%

Avg Salary(K)
Real number (ℝ≥0)

HIGH CORRELATION

Distinct219
Distinct (%)29.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean101.4845013
Minimum15.5
Maximum254
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.9 KiB
2022-10-02T10:04:14.959657image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum15.5
5-th percentile50
Q173.5
median97.5
Q3122.5
95-th percentile167.5
Maximum254
Range238.5
Interquartile range (IQR)49

Descriptive statistics

Standard deviation37.48244883
Coefficient of variation (CV)0.3693416071
Kurtosis0.9866909301
Mean101.4845013
Median Absolute Deviation (MAD)24.5
Skewness0.7964180441
Sum75301.5
Variance1404.93397
MonotonicityNot monotonic
2022-10-02T10:04:15.250874image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
87.512
 
1.6%
14011
 
1.5%
8111
 
1.5%
8510
 
1.3%
107.510
 
1.3%
56.510
 
1.3%
84.510
 
1.3%
10710
 
1.3%
879
 
1.2%
1209
 
1.2%
Other values (209)640
86.3%
ValueCountFrequency (%)
15.51
0.1%
27.52
0.3%
29.51
0.1%
37.52
0.3%
39.51
0.1%
40.51
0.1%
41.51
0.1%
422
0.3%
432
0.3%
442
0.3%
ValueCountFrequency (%)
2543
0.4%
237.51
 
0.1%
232.51
 
0.1%
2252
0.3%
221.51
 
0.1%
2053
0.4%
194.52
0.3%
1942
0.3%
184.52
0.3%
1813
0.4%

company_txt
Categorical

HIGH CARDINALITY

Distinct343
Distinct (%)46.2%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
MassMutual
 
14
Reynolds American
 
14
Takeda Pharmaceuticals
 
14
Software Engineering Institute
 
11
PNNL
 
10
Other values (338)
679 

Length

Max length51
Median length33
Mean length15.22506739
Min length2

Characters and Unicode

Total characters11297
Distinct characters70
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique161 ?
Unique (%)21.7%

Sample

1st rowTecolote Research
2nd rowUniversity of Maryland Medical System
3rd rowKnowBe4
4th rowPNNL
5th rowAffinity Solutions

Common Values

ValueCountFrequency (%)
MassMutual14
 
1.9%
Reynolds American14
 
1.9%
Takeda Pharmaceuticals14
 
1.9%
Software Engineering Institute11
 
1.5%
PNNL10
 
1.3%
Liberty Mutual Insurance10
 
1.3%
AstraZeneca9
 
1.2%
MITRE8
 
1.1%
Numeric, LLC7
 
0.9%
Advanced BioScience Laboratories7
 
0.9%
Other values (333)638
86.0%

Length

2022-10-02T10:04:15.559049image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
health36
 
2.4%
solutions22
 
1.5%
insurance22
 
1.5%
pharmaceuticals21
 
1.4%
the21
 
1.4%
inc21
 
1.4%
llc18
 
1.2%
of18
 
1.2%
institute16
 
1.1%
group15
 
1.0%
Other values (508)1299
86.1%

Most occurring characters

ValueCountFrequency (%)
e1019
 
9.0%
a891
 
7.9%
767
 
6.8%
n688
 
6.1%
t685
 
6.1%
i677
 
6.0%
r657
 
5.8%
o624
 
5.5%
s595
 
5.3%
c408
 
3.6%
Other values (60)4286
37.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter8423
74.6%
Uppercase Letter1950
 
17.3%
Space Separator767
 
6.8%
Other Punctuation81
 
0.7%
Decimal Number54
 
0.5%
Dash Punctuation18
 
0.2%
Math Symbol4
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e1019
12.1%
a891
10.6%
n688
 
8.2%
t685
 
8.1%
i677
 
8.0%
r657
 
7.8%
o624
 
7.4%
s595
 
7.1%
c408
 
4.8%
l408
 
4.8%
Other values (16)1771
21.0%
Uppercase Letter
ValueCountFrequency (%)
C192
 
9.8%
S178
 
9.1%
T151
 
7.7%
A148
 
7.6%
I135
 
6.9%
L122
 
6.3%
M117
 
6.0%
P111
 
5.7%
R109
 
5.6%
E94
 
4.8%
Other values (16)593
30.4%
Decimal Number
ValueCountFrequency (%)
220
37.0%
010
18.5%
37
 
13.0%
46
 
11.1%
15
 
9.3%
92
 
3.7%
62
 
3.7%
81
 
1.9%
71
 
1.9%
Other Punctuation
ValueCountFrequency (%)
.37
45.7%
,21
25.9%
&13
 
16.0%
'8
 
9.9%
/2
 
2.5%
Math Symbol
ValueCountFrequency (%)
<2
50.0%
>2
50.0%
Space Separator
ValueCountFrequency (%)
767
100.0%
Dash Punctuation
ValueCountFrequency (%)
-18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin10373
91.8%
Common924
 
8.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e1019
 
9.8%
a891
 
8.6%
n688
 
6.6%
t685
 
6.6%
i677
 
6.5%
r657
 
6.3%
o624
 
6.0%
s595
 
5.7%
c408
 
3.9%
l408
 
3.9%
Other values (42)3721
35.9%
Common
ValueCountFrequency (%)
767
83.0%
.37
 
4.0%
,21
 
2.3%
220
 
2.2%
-18
 
1.9%
&13
 
1.4%
010
 
1.1%
'8
 
0.9%
37
 
0.8%
46
 
0.6%
Other values (8)17
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII11297
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e1019
 
9.0%
a891
 
7.9%
767
 
6.8%
n688
 
6.1%
t685
 
6.1%
i677
 
6.0%
r657
 
5.8%
o624
 
5.5%
s595
 
5.3%
c408
 
3.6%
Other values (60)4286
37.9%

Job Location
Categorical

HIGH CORRELATION

Distinct37
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
CA
152 
MA
103 
NY
72 
VA
41 
IL
40 
Other values (32)
334 

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters1484
Distinct characters24
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st rowNM
2nd rowMD
3rd rowFL
4th rowWA
5th rowNY

Common Values

ValueCountFrequency (%)
CA152
20.5%
MA103
13.9%
NY72
 
9.7%
VA41
 
5.5%
IL40
 
5.4%
MD35
 
4.7%
PA33
 
4.4%
TX28
 
3.8%
NC21
 
2.8%
WA21
 
2.8%
Other values (27)196
26.4%

Length

2022-10-02T10:04:15.824343image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
ca152
20.5%
ma103
13.9%
ny72
 
9.7%
va41
 
5.5%
il40
 
5.4%
md35
 
4.7%
pa33
 
4.4%
tx28
 
3.8%
wa21
 
2.8%
nc21
 
2.8%
Other values (27)196
26.4%

Most occurring characters

ValueCountFrequency (%)
A382
25.7%
C201
13.5%
M158
10.6%
N142
 
9.6%
Y78
 
5.3%
I74
 
5.0%
L68
 
4.6%
T56
 
3.8%
D54
 
3.6%
V41
 
2.8%
Other values (14)230
15.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter1484
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A382
25.7%
C201
13.5%
M158
10.6%
N142
 
9.6%
Y78
 
5.3%
I74
 
5.0%
L68
 
4.6%
T56
 
3.8%
D54
 
3.6%
V41
 
2.8%
Other values (14)230
15.5%

Most occurring scripts

ValueCountFrequency (%)
Latin1484
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A382
25.7%
C201
13.5%
M158
10.6%
N142
 
9.6%
Y78
 
5.3%
I74
 
5.0%
L68
 
4.6%
T56
 
3.8%
D54
 
3.6%
V41
 
2.8%
Other values (14)230
15.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1484
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A382
25.7%
C201
13.5%
M158
10.6%
N142
 
9.6%
Y78
 
5.3%
I74
 
5.0%
L68
 
4.6%
T56
 
3.8%
D54
 
3.6%
V41
 
2.8%
Other values (14)230
15.5%

Age
Real number (ℝ)

HIGH CORRELATION

Distinct102
Distinct (%)13.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.52425876
Minimum-1
Maximum277
Zeros0
Zeros (%)0.0%
Negative50
Negative (%)6.7%
Memory size5.9 KiB
2022-10-02T10:04:16.061281image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum-1
5-th percentile-1
Q112
median25
Q360
95-th percentile170
Maximum277
Range278
Interquartile range (IQR)48

Descriptive statistics

Standard deviation53.83907976
Coefficient of variation (CV)1.132875739
Kurtosis2.777736511
Mean47.52425876
Median Absolute Deviation (MAD)17
Skewness1.781382155
Sum35263
Variance2898.646509
MonotonicityNot monotonic
2022-10-02T10:04:16.622370image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-150
 
6.7%
1132
 
4.3%
1331
 
4.2%
2527
 
3.6%
1524
 
3.2%
921
 
2.8%
1019
 
2.6%
6318
 
2.4%
1418
 
2.4%
3718
 
2.4%
Other values (92)484
65.2%
ValueCountFrequency (%)
-150
6.7%
22
 
0.3%
412
 
1.6%
55
 
0.7%
616
 
2.2%
713
 
1.8%
815
 
2.0%
921
2.8%
1019
 
2.6%
1132
4.3%
ValueCountFrequency (%)
2771
 
0.1%
24014
1.9%
2091
 
0.1%
1914
 
0.5%
1752
 
0.3%
1727
0.9%
1711
 
0.1%
17014
1.9%
1695
 
0.7%
1652
 
0.3%

Python
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
1
392 
0
350 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1392
52.8%
0350
47.2%

Length

2022-10-02T10:04:16.927270image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:17.185719image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
1392
52.8%
0350
47.2%

Most occurring characters

ValueCountFrequency (%)
1392
52.8%
0350
47.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1392
52.8%
0350
47.2%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1392
52.8%
0350
47.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1392
52.8%
0350
47.2%

spark
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
575 
1
167 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0575
77.5%
1167
 
22.5%

Length

2022-10-02T10:04:17.392846image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:17.670141image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0575
77.5%
1167
 
22.5%

Most occurring characters

ValueCountFrequency (%)
0575
77.5%
1167
 
22.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0575
77.5%
1167
 
22.5%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0575
77.5%
1167
 
22.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0575
77.5%
1167
 
22.5%

aws
Categorical

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
566 
1
176 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0566
76.3%
1176
 
23.7%

Length

2022-10-02T10:04:17.870548image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:18.101598image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0566
76.3%
1176
 
23.7%

Most occurring characters

ValueCountFrequency (%)
0566
76.3%
1176
 
23.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0566
76.3%
1176
 
23.7%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0566
76.3%
1176
 
23.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0566
76.3%
1176
 
23.7%

excel
Categorical

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
1
388 
0
354 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row1
4th row0
5th row1

Common Values

ValueCountFrequency (%)
1388
52.3%
0354
47.7%

Length

2022-10-02T10:04:18.308327image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:18.524815image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
1388
52.3%
0354
47.7%

Most occurring characters

ValueCountFrequency (%)
1388
52.3%
0354
47.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1388
52.3%
0354
47.7%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1388
52.3%
0354
47.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1388
52.3%
0354
47.7%

sql
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
1
380 
0
362 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row1

Common Values

ValueCountFrequency (%)
1380
51.2%
0362
48.8%

Length

2022-10-02T10:04:18.729468image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:18.955764image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
1380
51.2%
0362
48.8%

Most occurring characters

ValueCountFrequency (%)
1380
51.2%
0362
48.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1380
51.2%
0362
48.8%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1380
51.2%
0362
48.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1380
51.2%
0362
48.8%

sas
Categorical

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
676 
1
 
66

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row1
4th row0
5th row1

Common Values

ValueCountFrequency (%)
0676
91.1%
166
 
8.9%

Length

2022-10-02T10:04:19.154267image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:19.429342image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0676
91.1%
166
 
8.9%

Most occurring characters

ValueCountFrequency (%)
0676
91.1%
166
 
8.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0676
91.1%
166
 
8.9%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0676
91.1%
166
 
8.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0676
91.1%
166
 
8.9%

keras
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
713 
1
 
29

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0713
96.1%
129
 
3.9%

Length

2022-10-02T10:04:19.672275image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:20.052021image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0713
96.1%
129
 
3.9%

Most occurring characters

ValueCountFrequency (%)
0713
96.1%
129
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0713
96.1%
129
 
3.9%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0713
96.1%
129
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0713
96.1%
129
 
3.9%

pytorch
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
703 
1
 
39

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0703
94.7%
139
 
5.3%

Length

2022-10-02T10:04:20.324071image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:20.593904image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0703
94.7%
139
 
5.3%

Most occurring characters

ValueCountFrequency (%)
0703
94.7%
139
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0703
94.7%
139
 
5.3%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0703
94.7%
139
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0703
94.7%
139
 
5.3%

scikit
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
688 
1
 
54

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0688
92.7%
154
 
7.3%

Length

2022-10-02T10:04:20.824752image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:21.043479image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0688
92.7%
154
 
7.3%

Most occurring characters

ValueCountFrequency (%)
0688
92.7%
154
 
7.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0688
92.7%
154
 
7.3%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0688
92.7%
154
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0688
92.7%
154
 
7.3%

tensor
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
670 
1
72 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0670
90.3%
172
 
9.7%

Length

2022-10-02T10:04:21.246145image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:21.531633image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0670
90.3%
172
 
9.7%

Most occurring characters

ValueCountFrequency (%)
0670
90.3%
172
 
9.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0670
90.3%
172
 
9.7%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0670
90.3%
172
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0670
90.3%
172
 
9.7%

hadoop
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
618 
1
124 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0618
83.3%
1124
 
16.7%

Length

2022-10-02T10:04:21.733410image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:21.963793image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0618
83.3%
1124
 
16.7%

Most occurring characters

ValueCountFrequency (%)
0618
83.3%
1124
 
16.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0618
83.3%
1124
 
16.7%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0618
83.3%
1124
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0618
83.3%
1124
 
16.7%

tableau
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
594 
1
148 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0594
80.1%
1148
 
19.9%

Length

2022-10-02T10:04:22.153872image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:22.378787image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0594
80.1%
1148
 
19.9%

Most occurring characters

ValueCountFrequency (%)
0594
80.1%
1148
 
19.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0594
80.1%
1148
 
19.9%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0594
80.1%
1148
 
19.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0594
80.1%
1148
 
19.9%

bi
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
686 
1
 
56

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0686
92.5%
156
 
7.5%

Length

2022-10-02T10:04:22.587946image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:22.814309image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0686
92.5%
156
 
7.5%

Most occurring characters

ValueCountFrequency (%)
0686
92.5%
156
 
7.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0686
92.5%
156
 
7.5%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0686
92.5%
156
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0686
92.5%
156
 
7.5%

flink
Categorical

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
732 
1
 
10

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0732
98.7%
110
 
1.3%

Length

2022-10-02T10:04:23.004799image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:23.229198image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0732
98.7%
110
 
1.3%

Most occurring characters

ValueCountFrequency (%)
0732
98.7%
110
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0732
98.7%
110
 
1.3%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0732
98.7%
110
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0732
98.7%
110
 
1.3%

mongo
Categorical

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
705 
1
 
37

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0705
95.0%
137
 
5.0%

Length

2022-10-02T10:04:23.414086image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:23.652157image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0705
95.0%
137
 
5.0%

Most occurring characters

ValueCountFrequency (%)
0705
95.0%
137
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0705
95.0%
137
 
5.0%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0705
95.0%
137
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0705
95.0%
137
 
5.0%

google_an
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
0
728 
1
 
14

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters742
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0728
98.1%
114
 
1.9%

Length

2022-10-02T10:04:23.840929image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:24.060927image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0728
98.1%
114
 
1.9%

Most occurring characters

ValueCountFrequency (%)
0728
98.1%
114
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number742
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0728
98.1%
114
 
1.9%

Most occurring scripts

ValueCountFrequency (%)
Common742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0728
98.1%
114
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0728
98.1%
114
 
1.9%

job_title_sim
Categorical

HIGH CORRELATION

Distinct10
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
data scientist
313 
other scientist
143 
data engineer
119 
analyst
101 
machine learning engineer
 
22
Other values (5)
44 

Length

Max length30
Median length25
Mean length13.53504043
Min length2

Characters and Unicode

Total characters10043
Distinct characters19
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowdata scientist
2nd rowdata scientist
3rd rowdata scientist
4th rowdata scientist
5th rowdata scientist

Common Values

ValueCountFrequency (%)
data scientist313
42.2%
other scientist143
19.3%
data engineer119
 
16.0%
analyst101
 
13.6%
machine learning engineer22
 
3.0%
Data scientist project manager16
 
2.2%
na10
 
1.3%
data analitics8
 
1.1%
data modeler5
 
0.7%
director5
 
0.7%

Length

2022-10-02T10:04:24.265705image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:24.558975image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
scientist472
33.2%
data461
32.4%
other143
 
10.1%
engineer141
 
9.9%
analyst101
 
7.1%
machine22
 
1.5%
learning22
 
1.5%
project16
 
1.1%
manager16
 
1.1%
na10
 
0.7%
Other values (3)18
 
1.3%

Most occurring characters

ValueCountFrequency (%)
t1678
16.7%
a1226
12.2%
i1150
11.5%
e1129
11.2%
s1053
10.5%
n955
9.5%
680
6.8%
c523
 
5.2%
d455
 
4.5%
r353
 
3.5%
Other values (9)841
8.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter9347
93.1%
Space Separator680
 
6.8%
Uppercase Letter16
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t1678
18.0%
a1226
13.1%
i1150
12.3%
e1129
12.1%
s1053
11.3%
n955
10.2%
c523
 
5.6%
d455
 
4.9%
r353
 
3.8%
g179
 
1.9%
Other values (7)646
 
6.9%
Space Separator
ValueCountFrequency (%)
680
100.0%
Uppercase Letter
ValueCountFrequency (%)
D16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin9363
93.2%
Common680
 
6.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
t1678
17.9%
a1226
13.1%
i1150
12.3%
e1129
12.1%
s1053
11.2%
n955
10.2%
c523
 
5.6%
d455
 
4.9%
r353
 
3.8%
g179
 
1.9%
Other values (8)662
 
7.1%
Common
ValueCountFrequency (%)
680
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII10043
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t1678
16.7%
a1226
12.2%
i1150
11.5%
e1129
11.2%
s1053
10.5%
n955
9.5%
680
6.8%
c523
 
5.2%
d455
 
4.5%
r353
 
3.5%
Other values (9)841
8.4%
Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
na
519 
sr
220 
jr
 
3

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters1484
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowna
2nd rowna
3rd rowna
4th rowna
5th rowna

Common Values

ValueCountFrequency (%)
na519
69.9%
sr220
29.6%
jr3
 
0.4%

Length

2022-10-02T10:04:24.893441image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:25.128791image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
na519
69.9%
sr220
29.6%
jr3
 
0.4%

Most occurring characters

ValueCountFrequency (%)
n519
35.0%
a519
35.0%
r223
15.0%
s220
14.8%
j3
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1484
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n519
35.0%
a519
35.0%
r223
15.0%
s220
14.8%
j3
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Latin1484
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
n519
35.0%
a519
35.0%
r223
15.0%
s220
14.8%
j3
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII1484
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n519
35.0%
a519
35.0%
r223
15.0%
s220
14.8%
j3
 
0.2%

Degree
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
na
383 
M
252 
P
107 

Length

Max length2
Median length2
Mean length1.516172507
Min length1

Characters and Unicode

Total characters1125
Distinct characters4
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowM
3rd rowM
4th rowna
5th rowna

Common Values

ValueCountFrequency (%)
na383
51.6%
M252
34.0%
P107
 
14.4%

Length

2022-10-02T10:04:25.381960image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-02T10:04:25.700928image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
na383
51.6%
m252
34.0%
p107
 
14.4%

Most occurring characters

ValueCountFrequency (%)
n383
34.0%
a383
34.0%
M252
22.4%
P107
 
9.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter766
68.1%
Uppercase Letter359
31.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n383
50.0%
a383
50.0%
Uppercase Letter
ValueCountFrequency (%)
M252
70.2%
P107
29.8%

Most occurring scripts

ValueCountFrequency (%)
Latin1125
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
n383
34.0%
a383
34.0%
M252
22.4%
P107
 
9.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1125
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n383
34.0%
a383
34.0%
M252
22.4%
P107
 
9.5%

Interactions

2022-10-02T10:04:01.365588image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:49.392226image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:51.477319image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:53.431044image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:55.438905image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:57.583768image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:59.478040image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:04:01.623389image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:49.724615image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:51.734958image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:53.713290image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:55.710863image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:57.873309image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:59.724965image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:04:01.923760image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:50.076438image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:52.035550image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:54.014664image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:55.986230image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:58.129683image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:59.983627image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:04:02.209889image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:50.384004image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:52.313500image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:54.320980image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:56.288844image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:58.419301image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:04:00.287926image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:04:02.482514image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:50.658970image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:52.608681image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:54.619438image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:56.593330image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:58.692463image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:04:00.551829image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:04:02.748259image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:50.933685image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:52.867398image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:54.879478image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:57.035398image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:58.946093image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:04:00.818386image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:04:03.008200image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:51.191322image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:53.149587image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:55.165911image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:57.303718image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:03:59.205502image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-10-02T10:04:01.087265image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Correlations

2022-10-02T10:04:26.038648image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2022-10-02T10:04:26.659431image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2022-10-02T10:04:27.254900image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2022-10-02T10:04:27.863686image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.
2022-10-02T10:04:28.475823image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2022-10-02T10:04:03.608879image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
A simple visualization of nullity by column.
2022-10-02T10:04:06.008956image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

df_indexJob TitleSalary EstimateJob DescriptionRatingCompany NameLocationHeadquartersSizeFoundedType of ownershipIndustrySectorRevenueCompetitorsHourlyEmployer providedLower SalaryUpper SalaryAvg Salary(K)company_txtJob LocationAgePythonsparkawsexcelsqlsaskeraspytorchscikittensorhadooptableaubiflinkmongogoogle_anjob_title_simseniority_by_titleDegree
00Data Scientist$53K-$91K (Glassdoor est.)Data Scientist\nLocation: Albuquerque, NM\nEducation Required: Bachelor’s degree required, preferably in math, engineering, business, or the sciences.\nSkills Required:\nBachelor’s Degree in relevant field, e.g., math, data analysis, database, computer science, Artificial Intelligence (AI); three years’ experience credit for Master’s degree; five years’ experience credit for a Ph.D\nApplicant should be proficient in the use of Power BI, Tableau, Python, MATLAB, Microsoft Word, PowerPoint, Excel, and working knowledge of MS Access, LMS, SAS, data visualization tools, and have a strong algorithmic aptitude\nExcellent verbal and written communication skills, and quantitative analytical skills are required\nApplicant must be able to work in a team environment\nU.S. citizenship and ability to obtain a DoD Secret Clearance required\nResponsibilities: The applicant will be responsible for formulating analytical solutions to complex data problems; creating data analytic models to improve data metrics; analyzing customer behavior and trends; delivering insights to stakeholders, as well as designing and crafting reports, dashboards, models, and algorithms to make data insights actionable; selecting features, building and optimizing classifiers using machine learning techniques; data mining using state-of-the-art methods, extending organization’s data with third party sources of information when needed; enhancing data collection procedures to include information that is relevant for building analytic systems; processing, cleansing, and verifying the integrity of data used for analysis; doing ad-hoc analysis and presenting results in a clear manner; and creating automated anomaly detection systems and constant tracking of its performance.\nBenefits:\nWe offer competitive salaries commensurate with education and experience. We have an excellent benefits package that includes:\nComprehensive health, dental, life, long and short term disability insurance\n100% Company funded Retirement Plans\nGenerous vacation, holiday and sick pay plans\nTuition assistance\n\nBenefits are provided to employees regularly working a minimum of 30 hours per week.\n\nTecolote Research is a private, employee-owned corporation where people are our primary resource. Our investments in technology and training give our employees the tools to ensure our clients are provided the solutions they need, and our very high employee retention rate and stable workforce is an added value to our customers. Apply now to connect with a company that invests in you.3.8Tecolote Research\n3.8Albuquerque, NMGoleta, CA501 - 10001973Company - PrivateAerospace & DefenseAerospace & Defense$50 to $100 million (USD)-100539172.0Tecolote ResearchNM481001010000011000data scientistnaM
11Healthcare Data Scientist$63K-$112K (Glassdoor est.)What You Will Do:\n\nI. General Summary\n\nThe Healthcare Data Scientist position will join our Advanced Analytics group at the University of Maryland Medical System (UMMS) in support of its strategic priority to become a data-driven and outcomes-oriented organization. The successful candidate will have 3+ years of experience with Machine Learning, Predictive Modeling, Statistical Analysis, Mathematical Optimization, Algorithm Development and a passion for working with healthcare data. Previous experience with various computational approaches along with an ability to demonstrate a portfolio of relevant prior projects is essential. This position will report to the UMMS Vice President for Enterprise Data and Analytics (ED&A).\n\nII. Principal Responsibilities and Tasks\n\n• Develops predictive and prescriptive analytic models in support of the organization’s clinical, operations and business initiatives and priorities.\n• Deploys solutions so that they provide actionable insights to the organization and are embedded or integrated with application systems\n• Supports and drives analytic efforts designed around organization’s strategic priorities and clinical/business problems\n• Works in a team to drive disruptive innovation, which may translate into improved quality of care, clinical outcomes, reduced costs, temporal efficiencies and process improvements.\n• Builds and extends our analytics portfolio supported by robust documentation\n• Works with autonomy to find solutions to complex problems using open source tools and in-house development\n• Stays abreast of state-of-the-art literature in the fields of operations research, statistical modeling, statistical process control and mathematical optimization\n• Creates, communicates, and manages the project plans and other required project documentation and provides updates to leadership as necessary\n• Develops and maintains relationships with business, IT and clinical leaders and stakeholders across the enterprise to facilitate collaboration and effective communication\n• Works with the analytics team and clinical/business stakeholders to develop pilots so that they may be tested and validated in pilot settings\n• Performs analysis to evaluate primary and secondary objectives from such pilots\n• Assists leadership with strategies for scaling successful projects across the organization and enhances the analytics applications based on feedback from end-users and clinical/business consumers\n• Assists leadership with dissemination of success stories (and failures) in an effort to increase analytics literacy and adoption across the organization.\n\nWhat You Need to Be Successful:\n\nIII. Education and Experience\n\n• Master’s or higher degree (may be substituted by relevant work experience) in applied mathematics, physics, computer science, engineering, statistics or a related field\n• 3+ years of Mathematical Optimization, Machine Learning, Predictive Analytics and Algorithm Development experience (experience with tools such as WEKA, RapidMiner, R. Python or other open source tools strongly desired)\n• Strong development skills in two or more of the following: C/C++, C#, Python, Java\n• Combining analytic methods with advanced data visualizations\n• Expert ability to breakdown and clearly define problems\n• Experience with Natural Language Processing preferred\n\nIV. Knowledge, Skills and Abilities\n\n• Proven communications skills – Effective at working independently and in collaboration with other staff members. Capable of clearly presenting findings orally, in writing, or through graphics.\n• Proven analytical skills – Able to compare, contrast, and validate work with keen attention to detail. Skilled in working with “real world” data including scrubbing, transformation, and imputation.\n• Proven problem solving skills – Able to plan work, set clear direction, and coordinate own tasks in a fast-paced multidisciplinary environment. Expert at triaging issues, identifying data anomalies, and debugging software.\n• Design and prototype new application functionality for our products.\n• Change oriented – actively generates process improvements; supports and drives change, and confronts difficult circumstances in creative ways\n• Effective communicator and change agent\n• Ability to prioritize the tasks of the project timeline to achieve the desired results\n• Strong analytic and problem solving skills\n• Ability to cooperatively and effectively work with people from various organization levels\n\nWe are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class.3.4University of Maryland Medical System\n3.4Linthicum, MDBaltimore, MD10000+1984Other OrganizationHealth Care Services & HospitalsHealth Care$2 to $5 billion (USD)-1006311287.5University of Maryland Medical SystemMD371000000000000000data scientistnaM
22Data Scientist$80K-$90K (Glassdoor est.)KnowBe4, Inc. is a high growth information security company. We are the world's largest provider of new-school security awareness training and simulated phishing. KnowBe4 was created to help organizations manage the ongoing problem of social engineering. Tens of thousands of organizations worldwide use KnowBe4's platform to mobilize their end users as a last line of defense and enable them to make better security decisions, every day.\n\nWe are ranked #1 best place to work in technology nationwide by Fortune Magazine and have placed #1 or #2 in The Tampa Bay Top Workplaces Survey for the last four years. We also just had our 27th record-setting quarter in a row!\n\nThe Data Scientist will work closely with the VP of FP&A and the Quantitative Analytics Manager to implement advanced analytical models and other data-driven solutions.\n\nResponsibilities:\nWork with key stakeholders throughout the organization to identify opportunities using financial data to develop business solutions.\nDevelop new and enhance existing data collection procedures to ensure that all data relevant for analyses is captured.\nCleanse, consolidate, and verify the integrity of data used in analyses.\nBuild and validate predictive models to increase customer retention, revenue generation, and other business outcomes.\nDevelop relevant statistical models to assist with profitability forecasting\nCreate the analytics to leverage known, inferred and appended information about origins and recognizing patterns to assist in outlook forecasting\nAssist in the design and data modeling of data warehouse.\nVisualize data, especially in reports and dashboards, to communicate analysis results to stakeholders.\nExtend data collection to unstructured data within the company and external sources\nMine and collect data (both structured and unstructured) to detect patterns, opportunities and insights that drive our organization\nCreate and execute automation and data mining requests utilizing SQL, Access, Excel, SAS and other statistical programs\nTrouble shoot forecast and optimization anomalies with FP&A team through the use of statistical and mathematical optimization models. Develop testing to explain and or reduce these anomalies.\nOversee and develop key metric forecasts as well as provide budget support based on trends in the business/industry.\nMinimum Qualifications:\nMaster's degree in Statistics, Computer Science, Mathematics or other quantitative discipline required\n2-3 years of experience in similar role (Master's Degree)\n0-2 years of experience in similar role (PhD)\nExperience leveraging predictive modeling, big data analytics, exploratory data analysis and machine learning to drive significant business impact\nExperience with statistical computer languages (Python, R etc.) to manipulate and analyze large datasets preferred.\nExperience with data visualization tools like D3.js, matplotlib, etc., preferred\nExceptional understanding of machine learning algorithms such as Random Forest, SVM, k-NN, Naïve Bayes, Gradient Boosting a plus.\nApplied statistical skills including statistical testing, regression, etc.\nExperience with data bases, query languages, and associated data architecture.\nExperience with distributed computing tools (Hive, Spark, etc.) is a plus.\nStrong analytical skills and ability to meet project deadlines.\nNote: An applicant assessment, background check and drug test may be part of your hiring procedure.\n\nNo recruitment agencies, please.4.8KnowBe4\n4.8Clearwater, FLClearwater, FL501 - 10002010Company - PrivateSecurity ServicesBusiness Services$100 to $500 million (USD)-100809085.0KnowBe4FL111101110000000000data scientistnaM
33Data Scientist$56K-$97K (Glassdoor est.)*Organization and Job ID**\nJob ID: 310709\n\nDirectorate: Earth & Biological Sciences\n\nDivision: Biological Sciences\n\nGroup: Exposure Science Team\n*Job Description**\nThe Biological System Science (BSS) Group in the Biological Sciences Division of the Pacific Northwest National Laboratory (PNNL) is seeking a staff scientist with multidisciplinary experience in computational chemistry, cheminformatics, advanced statistics and/or machine learning/deep learning/AI. Preferred candidates will have a broad understanding of the state of computational metabolomics and experience in designing and implementing novel deep learning networks for chemistry applications. Research experience in drug design, cheminformatics, deep learning, machine learning and/or small molecule identification is also highly valued. Successful candidates will join a large, uniquely collaborative, collegial group of innovators driving the integration of data science, computational science and analytical chemistry to solve the nations most challenging problems in human health, chemical forensics, and national security. The BSS Group is diverse and inclusive, working closely with colleagues across the laboratory with expertise in computational biology, integrative omics, applied mathematics, computer science, and statistics.\n\n+ Apply knowledge of statistics, machine learning, advanced mathematics, simulation, software development, and data modeling to to design, development and implement methods that integrate, clean and analyze data, recognize patterns, address uncertainty, pose questions, and make discoveries from structured and/or unstructured data.\n\n+ Produce solutions driven by exploratory data analysis from complex and high-dimensional datasets.\n\n+ Design, develop, and evaluate predictive models and advanced algorithms that lead to optimal value extraction from data.\n\n+ Develop and maintain existing deep learning networks that generate novel molecules for drug discovery applications\n\n+ Contribue or author proposals, peer-reviewed papers, and other technical products.\n*Minimum Qualifications**\nBS/BA with 0-1 years of experience or MS/MA with 0-1 years of experience\n*Preferred Qualifications**\n+ MS in chemical engineering, computer science, or related field with a GPA of 3.5+ 5+ years of research experience\n\n+ Intermediate level programming experience (preferably Python) and high-performance computing experience\n\n+ At least one first author published, or proof of submitted, paper applying deep learning for use in novel compound generation\n\n+ Understanding of the NMDA receptor and potential drug targets\n\n+ Research experience in drug design, cheminformatics, deep learning, machine learning and/or small molecule identification\n*Equal Employment Opportunity**\nBattelle Memorial Institute (BMI) at Pacific Northwest National Laboratory (PNNL) is an Affirmative Action/Equal Opportunity Employer and supports diversity in the workplace. All employment decisions are made without regard to race, color, religion, sex, national origin, age, disability, veteran status, marital or family status, sexual orientation, gender identity, or genetic information. All BMI staff must be able to demonstrate the legal right to work in the United States. BMI is an E-Verify employer. Learn more at jobs.pnnl.gov.\n*_Please be aware that the Department of Energy (DOE) prohibits DOE employees and contractors from participation in certain foreign government talent recruitment programs. If you are offered a position at PNNL and are currently a participant in a foreign government talent recruitment program you will be required to disclose this information before your first day of employment._**\n_Directorate:_ _Earth & Biological Sciences_\n\n_Job Category:_ _Scientists/Scientific Support_\n\n_Group:_ _Biological Systems Science_\n\n_Opening Date:_ _2020-03-26_\n\n_Closing Date:_ _2020-04-05_3.8PNNL\n3.8Richland, WARichland, WA1001 - 50001965GovernmentEnergyOil, Gas, Energy & Utilities$500 million to $1 billion (USD)Oak Ridge National Laboratory, National Renewable Energy Lab, Los Alamos National Laboratory00569776.5PNNLWA561000000000000000data scientistnana
44Data Scientist$86K-$143K (Glassdoor est.)Data Scientist\nAffinity Solutions / Marketing Cloud seeks smart, curious, technically savvy candidates to join our cutting-edge data science team. We hire the best and brightest and give them the opportunity to work on industry-leading technologies.\nThe data sciences team at AFS/Marketing Cloud build models, machine learning algorithms that power all our ad-tech/mar-tech products at scale, develop methodology and tools to precisely and effectively measure market campaign effects, and research in-house and public data sources for consumer spend behavior insights. In this role, you'll have the opportunity to come up with new ideas and solutions that will lead to improvement of our ability to target the right audience, derive insights and provide better measurement methodology for marketing campaigns. You'll access our core data asset and machine learning infrastructure to power your ideas.\nDuties and Responsibilities\n· Support all clients model building needs, including maintaining and improving current modeling/scoring methodology and processes,\n· Provide innovative solutions to customized modeling/scoring/targeting with appropriate ML/statistical tools,\n· Provide analytical/statistical support such as marketing test design, projection, campaign measurement, market insights to clients and stakeholders.\n· Mine large consumer datasets in the cloud environment to support ad hoc business and statistical analysis,\n· Develop and Improve automation capabilities to enable customized delivery of the analytical products to clients,\n· Communicate the methodologies and the results to the management, clients and none technical stakeholders.\nBasic Qualifications\n· Advanced degree in Statistics/Mathematics/Computer Science/Economics or other fields that requires advanced training in data analytics.\n· Being able to apply basic statistical/ML concepts and reasoning to address and solve business problems such as targeting, test design, KPI projection and performance measurement.\n· Entrepreneurial, highly self-motivated, collaborative, keen attention to detail, willingness and capable learn quickly, and ability to effectively prioritize and execute tasks in a high pressure environment.\n· Being flexible to accept different task assignments and able to work on a tight time schedule.\n· Excellent command of one or more programming languages; preferably Python, SAS or R\n· Familiar with one of the database technologies such as PostgreSQL, MySQL, can write basic SQL queries\n· Great communication skills (verbal, written and presentation)\nPreferred Qualifications\n· Experience or exposure to large consumer and/or demographic data sets.\n· Familiarity with data manipulation and cleaning routines and techniques.2.9Affinity Solutions\n2.9New York, NYNew York, NY51 - 2001998Company - PrivateAdvertising & MarketingBusiness ServicesUnknown / Non-ApplicableCommerce Signals, Cardlytics, Yodlee0086143114.5Affinity SolutionsNY231001110000000000data scientistnana
55Data Scientist$71K-$119K (Glassdoor est.)CyrusOne is seeking a talented Data Scientist who holds a range of data-focused skills both in technical and analytical domains. The ideal candidate is adept at processing, cleansing, and verifying the integrity of data used for visualization and analysis. This role is dynamic, granting the candidate the opportunity to participate in a wide variety of projects and collaborate with many cross-functional teams throughout the business.\n\nDuties and Responsibilities:\nParticipate in an agile scrum cadence\nProcess, cleanse, and verify the integrity of data used for analysis\nPerform functional business requirements analysis and data analysis\nDevelop data models and algorithms to apply to data sets\nAugment data collection procedures to include necessary information for building accurate analytics\nCollaborate with stakeholders throughout the organization to identify opportunities for leveraging data to drive business solutions\nEvaluate the effectiveness and accuracy of data sources and data gathering techniques\nGather critical information from meetings with various stakeholders and produce useful reports\nCoordinate with cross-functional teams to implement models and monitor outcomes\nDevelop automated discrepancy detection systems and distribute reconciliation reports to stakeholders\nRequirements:\nMust be legally authorized to work in the United States for any employer without sponsorship\nProfessional experience using statistical software languages like R, Python, and SQL to query, manipulate, and draw insights from data sets\nStrong problem-solving skills with an emphasis on product development\nExtensive experience with Microsoft SQL, MySQL and MongoDB\nUnderstanding of version control (git) and project management with Azure DevOps\nKnowledge of machine learning techniques (clustering, decision tree learning, artificial neural networks, etc.)\nExperience visualizing data for stakeholders using visualization tools such as Power BI\nExperience working with and creating data architectures\nUnderstanding and adherence to agile principles and practices\nAbility to work on problems of any scope where the analysis of situations or data requires a review of a variety of factors\nSelf-maintainability and reliability with minimal supervision\nExcellent interpersonal communication, decision making, presentation, and organizational skills\nAbility to build productive internal/external working relationships\nHarmonious with CyrusOne culture, core values, and business goals\nMinimum Qualifications:\n2+ years of related experience in a data analyst role\nStrong can-do attitude in a time sensitive environment\nOther important information about this position:\nThis position requires typical weekday (Monday - Friday) attendance in an office setting, at times after hours work may be required to meet business and customer needs\nEvery position requires certain physical capabilities. CyrusOne seeks to make reasonable accommodations that enable individuals with disabilities to perform essential duties when possible\nCyrusOne is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity, religion, national origin, disability, veteran status, or other legally protected status.\n\nCyrusOne provides reasonable accommodation for qualified individuals with disabilities in accordance with the Americans with Disabilities Act (ADA) and any other state or local laws. We will respond to requests for reasonable accommodations to assist you in applying for positions at CyrusOne, or to submit a resume. If you need to request an accommodation, please contact our Human Resources at 214.488.1365 (Option 7) or by email at HR@cyrusone.com.3.4CyrusOne\n3.4Dallas, TXDallas, TX201 - 5002000Company - PublicReal EstateReal Estate$1 to $2 billion (USD)Digital Realty, CoreSite, Equinix007111995.0CyrusOneTX211011100000001010data scientistnana
66Data Scientist$54K-$93K (Glassdoor est.)Job Description\n\n**Please only local candidates apply - thank you**\n\nClearOne Advantage is a fast-growing company that is aggressively hiring due to increased business. We are always improving our marketing, culture and technology to provide our employees with the best work atmosphere and our customers with excellent customer service. COA’s proprietary software is tailored to our industry and allows the client to receive the best service possible.\n\nWe are looking for a Data Scientist to analyze large amounts of raw information to find patterns that will help improve our company. We will rely on you to build data products to extract valuable business insights. In this role, you should be highly analytical with a knack for analysis, math and statistics. Critical thinking and problem-solving skills are essential for interpreting data. We want to see a passion for machine-learning and research. Your goal will be to help our company analyze trends to make better decisions.\n\nIf you are looking to work in a team environment, a place where you are more a name than a number, where you interact with leadership daily, then please send your resume for review!\n\nPerks:\nGreat location, right on the water in the Canton Crossing Tower\nCasual work environment and WFH flexibility\nRoom for advancement\nWhat you'll be doing:\nIdentify valuable data sources and automate collection processes\nUndertake preprocessing of structured and unstructured data\nAnalyze large amounts of information to discover trends and patterns\nBuild predictive models and machine-learning algorithms\nCombine models through ensemble modeling\nPresent information using data visualization techniques\nPropose solutions and strategies to business challenges\nCollaborate with engineering and product development teams4.1ClearOne Advantage\n4.1Baltimore, MDBaltimore, MD501 - 10002008Company - PrivateBanks & Credit UnionsFinanceUnknown / Non-Applicable-100549373.5ClearOne AdvantageMD130001000000000000data scientistnana
77Data Scientist$86K-$142K (Glassdoor est.)Advanced Analytics – Lead Data Scientist\nOverview\n\n\nWe are looking for a Data Scientist to join our Data Science team to work on interesting projects to help our clients make data driven solutions. As a Data Scientist, you’ll work closely with the clients to understand their business needs, frame them as statistical problems, and solve them with cutting edge techniques. Collaborate with your team, including machine learning engineers, data engineers, analysts, and TPMs to define tasks, provide estimates, and work together to deliver a world class solution. The ideal candidate will have the balance of technical skills and business acumen to help the client better understand their core needs while understanding technical limitations.\n\nAbout you…\nExperience partnering & communicating with executive management team to understand business needs and pain points\nAbility to communicate data science concepts to business stakeholders\nPassion for the application of machine learning to real world problems\nAdept at developing and iterating solutions rapidly\nAbility to lead development of data science solutions\nWhat we offer our consultants:\n\nExperience working with both large enterprise clients and mid-sized clients\nProgressive responsibilities that encourage ownership and design\nOpportunity to learn and gain experience in complimentary skills such as meeting facilitation, data management, project management, data modeling, and data management\nCompany Culture that celebrates “Foster the culture of we”, “Act with integrity” and “Drive towards excellence” while having fun at work\nTraining and certification opportunities to support your career now and after Logic20/20\nVarious opportunities to give back to the community through company sponsored events\nRequired Qualifications\nExperience building machine learning models using Python\nExperience deploying machine learning models in a production environment\nStrong knowledge of probability statistics\nExperience with Tensorflow or PyTorch\nExperience writing SQL to query databases, structure and modify data\nDemonstrated ability to frame business problems as statistical problems and solve them\nAbility to work both independently and as part of a team\nExperience working in ambiguous and dynamic environments that move quickly\nAn undergraduate degree in mathematics, computer science, or engineering is preferred\nPreferred Qualifications\nPassion and experience driving adoption of machine learning in industry\nExperience deploying machine learning on large scales through Spark or other big data technology\nExperience building systems in AWS\nExperience in computer vision with deep neural networks\nExperience with leading workshops with executives to drive requirements gathering\nMasters or PhD in data science or related field\n\nAbout Logic20/20. . .\n\n\nLogic20/20 is one of Seattle’s fastest growing full-service consulting firms. Our core competency is creating simplicity and efficiency in complex solutions. Although we make it look like magic, we succeed by combining methodical and structured approaches with our substantial experience to design elegant solutions for even the most intricate challenges. Our rapid growth is in response to our ability to deliver consistently for our clients, which is directly related to the quality of the people we hire.\n\nThe past four years, we’ve been in the top 10 “Best Companies to Work For” ….. why? Our team members are highly self-motivated, comfortable conceiving strategies on the fly, and enjoy working both individually and as part of a team. Our environment is very high-energy and demanding, and individuals with remarkable enthusiasm and a can-do attitude are joining our team. We have lots of fun, focus on our employees and our clients, and work to bring our best to every opportunity.3.8Logic20/20\n3.8San Jose, CASeattle, WA201 - 5002005Company - PrivateConsultingBusiness Services$25 to $50 million (USD)-10086142114.0Logic20/20CA161111100101000000data scientistnaM
88Research Scientist$38K-$84K (Glassdoor est.)SUMMARY\n\nThe Research Scientist I will be tasked with oversight of research in the Division of Cancer Biology Research at the Rochester General Hospital Research Institute.\n\nA strong background in Molecular Biology or Cancer Biology Research is preferred. Mouse models will be used in the research.\n\nSTATUS: Full Time\n\nLOCATION: RGH Research Institute\n\nDEPARTMENT: Cancer Biology\n\nSCHEDULE: Monday-Friday; Days\n\nATTRIBUTES\nMD or PhD who is not self supporting of their own salary nor has their own research program\nFunctions with minimal direction from Research Scientist II, Senior Research Scientist or Laboratory Director.\nStrong analytical, computer, leadership and problem-solving skills\nRESPONSIBILITIES\nConducts research projects including complex experiments, some in parallel, utilizing current concepts and recognized standard techniques, developing new protocols as necessary\nDemonstrates a high level of initiative in performing experiments, analyzing data and drawing conclusions regarding progress and results of work.\nMaintains a familiarity with current and emerging technologies through reading and understanding scientific and technical literature resulting in a broadening understanding of disciplines outside area of training and enabling the use of new and improved procedures in the laboratory.\nDuties are performed with an understanding of drug discovery in area of specialization.\nEDUCATION PhD; MD Rochester Regional Health is an Equal Opportunity / Affirmative Action Employer. Minority/Female/Disability/Veteran3.3Rochester Regional Health\n3.3Rochester, NYRochester, NY10000+2014HospitalHealth Care Services & HospitalsHealth Care$500 million to $1 billion (USD)-100388461.0Rochester Regional HealthNY70000000000000000other scientistnaP
99Data Scientist$120K-$160K (Glassdoor est.)isn’t your usual company. Our work is powered by the premise that every person at is unique, possessing a distinct set of skills, personality, and passions. We embrace our collective talents to tackle technical challenges, refine our successfully disruptive business ideas, and co-create one of the most human and inspiring work cultures out there. We are a team of collaborators, valuing and rewarding shared success over individual heroics.\nAs a member of our Data Science team, you will use your quantitative expertise to identify new areas of research and optimization, and then see those ideas through to production. Data Science is a fundamental contributor to Intent’s success - your work will have a direct and tangible impact on the business.\nThere are no typical projects, but a workflow might involve performing research and analysis against petabytes of historical data using our collection of large-scale analytics tools like Spark, Snowflake, and RedShift, building prototypes using mostly Scala or another functional language, pairing with engineers on the Modeling and Prediction team to harden and deploy the functionality, and running live tests to monitor the results.\nAll of these steps take place in an environment of respect and collaboration, and the Data Science team is empowered to own its agile processes. Every member of the team is expected to be both a student and teacher, and we believe that the most effective Data Science team is one that is collectively learning and growing. Experience in coaching and mentoring colleagues at all levels is strongly desired. As part of the Data Science team, you’d help build out a real-time predictive analytics platform that makes decisions for some of the largest sites on the web.\nAbout You:\nSignificant industry experience in several of the following areas: personalized experiences, big data analytics, implementing machine learning & statistical methods, designing and running A/B tests, product design and life cycle, writing production code, designing online auctions.\nExperience in user experience customization a plus\nExperience coaching and mentoring team members\nExperience writing production software in languages like Scala, Clojure, Java, Python, or C++ in an agile, collaborative environment\nExperience with handling large amounts of data (TB+) in a production setting\nExperience with Spark is a significant plus\nExperience in ad-tech a plus\nAbout Us:\nis the data science company for the world’s leading online commerce and travel brands. Our Predictive Intelligence Platform uses patented technology to predict user behavior in real-time and identify the future value of every user. Over 450 innovative brands from more than 40 countries trust Intent’s real-time predictions to deliver personalized user experiences that maximize utility and ROI.\nOur team is over 100 people and our offices span globally. We’re headquartered in NYC with locations in London, Kuala Lumpur, and Sao Paulo.\nEvery day, we’re inspired by two pursuits. First, we’re building novel products that are upending e-commerce. Second, we’re building the company we’ve always wanted to work for — one that’s open, human and collaborative, where very smart people come together to share ideas and get things done. We’re included on Built in NYC's Best Places to Work list and have been on Crain’s 100 Best Places to Work in NYC list for seven years running.\nLove Your Job!\nOur employees enjoy coming to work, and we let them know they're valued.\nOur vibrant team accomplishes a lot every day, but we insist upon work/life balance so things never become stale. We don’t take ourselves too seriously, but we take our work very seriously.\nWe believe that in order for our employees to perform their best, they need access to strategic decisions, and so our flat structure and open communication invite innovation from all levels — ideas flow freely.\nWe offer competitive compensation, stock options, and great perks & benefits, including:\nUnlimited vacation\nA generous parental leave policy\nA beautiful, dog-friendly office in SoHo with drinks and snacks\nAn open environment with lots of natural light and roof deck access\nAnnual $2,000 learning budget and Citi Bike membership\nAccess to Fond, our employee perks program featuring deals and discounts on hundreds of products and services\nAccess to Sherpaa, a telehealth service with 24/7\nIn-office yoga classes\nCompany-wide social events, and more!\nSo what are you waiting for? Apply with your resume in just a few clicks!\nAbout Us\nOur Products\nOur Dogs\nTwitter\nInstagram4.6<intent>\n4.6New York, NYNew York, NY51 - 2002009Company - PrivateInternetInformation Technology$100 to $500 million (USD)Clicktripz, SmarterTravel00120160140.0<intent>NY121100000000000000data scientistnana

Last rows

df_indexJob TitleSalary EstimateJob DescriptionRatingCompany NameLocationHeadquartersSizeFoundedType of ownershipIndustrySectorRevenueCompetitorsHourlyEmployer providedLower SalaryUpper SalaryAvg Salary(K)company_txtJob LocationAgePythonsparkawsexcelsqlsaskeraspytorchscikittensorhadooptableaubiflinkmongogoogle_anjob_title_simseniority_by_titleDegree
732945Machine Learning Engineer (NLP)$80K-$142K (Glassdoor est.)CK-12’s mission is to provide free access to open-source content and technology tools that empower students as well as teachers to enhance and experiment with different learning styles, resources, levels of competence, and circumstances.\n\nTo achieve this noble and ambitious vision, we at CK-12 are challenging the traditional model of education to transform it dramatically. Technology has opened up lots of opportunities to revolutionize education for the benefit of students, teachers, and parents.\n\nWe have chosen to be non-profit so that we can effectively realize our mission and so that we can do the right thing! It also provides us the ability to experiment with big and bold ideas. CK-12 is backed by Vinod Khosla, a renowned technology venture capitalist.\n\nAt CK-12, you’ll experience the benefits of working in a dynamic, entrepreneurial, innovative and non-bureaucratic environment where you will get a lot of cool things done than you ever imagined! We are a small group of passionate folks who are determined to disrupt the current form of education. We came together from companies such as Apple, eBay, Amazon, McGraw-Hill, and startups.\n\nTechnology is key to scale education and we deeply believe in it. Come develop great solutions on our cloud-based (AWS) AI-first platform delivering rich and interactive content.\n\nDoes our mission, people and technologies excite you? If the answer is YES! and you are a great technologist who will challenge status-quo (no order takers please!) by innovating, please come join us! Together, we will change the world!\n\nCORE RESPONSIBILITIES\nAnalyze textual content and apply NLP to build\nQuestion and Answering System\nNatural Language Generation\nContent Summarization\nLearning Chatbot\nML Assisted Grading System\nExtract and analyze large volumes of data deeply to understand and deduce a wide range of information about CK-12 students, teachers based on their usage history\nApply Machine learning algorithms to\nDiscover patterns in usage\nPredict users behavior\nIdentify student knowledge gaps and misconceptions\nExtract knowledge from CK-12 content using deep learning\nEnvision, experiment, build (or discard), and deliver ML products that can disrupt the Edtech space\nHave fun while driving innovation through ML by challenging the status quo in education and learning and providing creative ML-based solutions\nREQUIREMENTS\nBachelor’s or higher degree in a quantitative discipline (Computer Science or equivalent) or equivalent work experience\nHands-on developer with 3+ years of experience and excellent programming skills (Python is a strong plus)\n3+ years of experience in NLP\nExperience with recent developments in Deep Learning-based NLP\nExperience with a combination of the following:\nQuestion Answering\nKnowledge Graph\nDialog Systems/Conversational systems\nMachine Translation\nNatural Language Generation\nText Summarization\nExperience in building scalable production services\nSkills: Python, TensorFlow, PyTorch, MXNet\nCapacity to handle multiple tasks and prioritize effectively\nAble to translate high-level directions and open-ended questions into practical projects and lead/drive their completion with minimal supervision\nEnvision what ML can do for education\nHOW TO APPLY\n\nSubmit your resume to ml@ck12.org with “Machine Learning Engineer (NLP)” in the subject line.\nIt is a full-time position at our office in Palo Alto, CA (no telecommuting)\nThe applicant must be authorized to work in the US for any employer4.1CK-12 Foundation\n4.1Palo Alto, CAPalo Alto, CA1 - 502007Company - PrivateK-12 EducationEducationUnknown / Non-Applicable-10080142111.0CK-12 FoundationCA141011000101000000machine learning engineernana
733946Senior Data Analyst$99K-$178K (Glassdoor est.)Senior Data Analyst\n\nAbout us\n\n\nLife360 brings families closer with smart tools designed to protect and connect the people who matter most.\n\nKnown for first-to-market solutions for modern family challenges, Life360 recently reached #1 in Apple's US App Store's list of free social networking apps. Nearly 1 in 10 US families with kids use Life360 an average of 12 times a day, and global membership is growing exponentially, with over 25 million monthly active users in over 140 countries making Life360 the largest mobile service for families in the world.\n\nThis reach gives us the opportunity to do unprecedented good for families through our valued core offerings: advanced location sharing, private messaging, driver monitoring, help alerts, 24/7 roadside assistance, and Crash Detection with emergency response. On average we respond to 1,000 roadside assists and dispatch 200+ ambulances each month to those in need.\n\nOffering both free and paid memberships. In addition, the company has raised over $200 million in equity financing, and recently completed an IPO on the ASX exchange giving our employees the liquidity of a public company with the upside of a private growth stage business.\n\nLife360's rapidly growing team of 150+ employees is headquartered in San Francisco, with offices in San Diego, and Las Vegas.\n\nAbout the Job\n\n\nData plays a crucial role in Life360's growth by driving smarter decisions, improving operations, and creating higher value user experiences. As an analytics team, we partner with a wide variety of cross-functional partners to apply data insights against strategic initiatives. "Know Our Users" is a Life360 core value and we're looking for analytics professionals who are passionate about leveraging user data to create value for Life360 families.\n\nYou'll be working in a dynamic growth environment, leading efforts to better understand the business, the product, and the customer. Life360 has one of the most interesting datasets in the world: location, driving, product usage, and purchasing behavioral data - all centered around who matters most, the family. If you have a passion for making an impact and working on products that help millions of families around the world, then this is the right place for you.\n\nResponsibilities\n\n\nAnalytics team members work closely with specific strategic teams but also have opportunities to work on company-wide initiatives. This person is expected to focus on that particular area but also generalize their skills towards other parts of the business with a variety of projects.\n\nIn this role, we are looking for someone to partner with the Revenue team in developing actionable insights from both product and financial perspectives. Common projects range from financial disclosure reports that tell Life360's growth story to conducting deep-dive analyses into identifying opportunities for subscription growth. Ultimately, you will be tasked with finding data insights that deliver business value.\n\nThese are some typical responsibilities:\nLeverage data to understand the Life360 family and their product usage, developing insights that apply to product, marketing, and business strategy.\nPartner with executives, product managers, engineers, marketers, designers to translate data insights into smarter decisions and applications.\nEstablish and manage KPIs that measure the health of the business, product performance, and customer experience quality.\nBuild dashboards and reporting processes to monitor business and product trends.\nDevelop frameworks, tools, and best practices to apply data insights towards business questions.\nConduct analyses and build models that identify opportunities and drive growth.\nDesign and analyze experiments, communicate results, and drive decisions.\nPotential projects may include forecasting business performance, developing family driving profiles, and predicting customer lifetime value.\nQualifications\n\n\nWe are looking for candidates with a diverse background that will compliment the skills and backgrounds of the current team. If you don't fit all the criteria below please apply anyway as this list is more of a preference rather than a rule. Our priority is for a well rounded team that delivers results.\nWe are looking for candidates who have had previous experience on analytics teams and are willing to help coach and mentor colleagues on best data practices. 5+ years is preferred.\nDegree in a quantitative field like statistics, economics, applied math, operations research, or engineering, finance, business intelligence. Advanced degrees are preferred.\nSQL expertise - able to write structured and efficient queries on large datasets.\nExperience in scripting languages, like analysis and visualization libraries in Python or R.\nStrong verbal/written communication skills and the ability to collaborate with cross-functional partners to build the business.\nProficiency in building data visualizations and interactive dashboards with tools like Tableau.\nExperience designing and evaluating experiments to draw inferential recommendations.\nCuriosity to learn about new topics and uncover hidden insights.\nPerks\nFridays are Work From Home days at Life360\nCompetitive pay and benefits\nFree snacks, drinks (three ways to brew your favorite cup of coffee), and food in the office\nCatered lunches throughout the week\nHealth, dental and vision insurance plans\n401k plan\n$200/month Quality of Life perk\nA great office with plenty of light in the heart of the SOMA district in beautiful San Francisco\nWhatever makes you stronger makes us stronger. We buy you the things you need to improve yourself and get your job done.\nThis position is located in San Francisco, CA. It is not a remote role.3.9Life360\n3.9San Francisco, CASan Francisco, CA51 - 2002008Company - PublicComputer Hardware & SoftwareInformation TechnologyUnknown / Non-Applicable-10099178138.5Life360CA131000100000010000analystsrna
734947Data Science Project Manager$37K-$100K (Glassdoor est.)At MassMutual, we are passionate about helping millions of people find financial freedom and this passion has driven our approach to developing meaningful experiences for our customers. The Data Science team, part of the Enterprise Technology and Experience organization, is comprised of highly skilled and collaborative problem solvers who are motivated to create innovative solutions that exceed the changing needs of our customers and move MassMutual and the industry forward.\n\nTo continue our cutting-edge work, we are hiring a Data Science Project Manager to join our team.\n\nWhat great looks like for this role\n\nA seasoned Project Manager will have the opportunity to apply advanced project and program management knowledge, skills, tools and techniques to project deliverables, processes, communications and presentations in order to meet or exceed stakeholder needs and expectations. The Project Manager will have the ability to think strategically to understand, apply, promote and contribute to MassMutual's delivery methodologies, standards and tools. This individual will work with a team that embraces diversity in all of its forms, respects others and looks to have fun.\n\nObjectives of this role\nTo scale our data science impact.\nTo impact complex business goals through the delivery of quality work timely.\nTo ensure documentation is in place and process is followed meeting standards\nDaily and Monthly Responsibilities - What You Will Do:\nLead broad scope projects that have medium to long-term focus\nEngage with all levels across the enterprise\nServe as a conduit of knowledge between functional and technical teams\nCommunicate regularly with individuals both within and outside of our team, managing relationships and expectations\nNavigate ambiguity to deliver results\nDevelop plans for continuous service to support implementation of products\nAct as a champion for data science capabilities by communicating their benefits and how they can be implemented\nProvide consultation, business analysis, project management, and leadership on multiple projects of varying duration, size, and complexity\nMotivate teams to work together, communicate, and deliver\nElicit, translate and simplify requirements\nDocument and organize acceptance criteria for user stories\nManage budget, timeline, and scope throughout the course of all assigned projects\nLead project teams during all phases of the development life cycle including requirements gathering and analysis, design, build, pilot, implementation and continuous service\nFacilitate client and project team interactions including: scrums, sprint planning, sprint retrospectives, sprint reviews, incident management and release management\nWork with product managers to define improvements to business processes, assist decision-makers in gathering information to make decisions, and help quality assurance test solutions\nWork with technical leads, product managers to plan, develop technical scopes of work and manage the execution of projects/product changes in response to requirements from our stakeholders\nBe self-supportive in collaborating with peers to effectively deliver a robust solution for the business\nDrive process within a matrix management setting\nWhat You Will Not Do:\nDesign strategic roadmaps\nLarge amounts of computer programming\nManipulation of large data sets\nSit in solitude at your desk\nBasic Qualifications\nBachelors Degree preferably in Business/Finance or an analytical field such as Economics, Mathematics, Engineering, Computer Science\n4+ years managing and driving the execution of complex projects\nExperience in/working in partnership with a technical role, such as an engineer, developer, data scientist, etc. a plus\nProficient with project management tools and techniques, such as JIRA, Confluence, Scrum and Kanban\nExcellent interpersonal communication, conflict management, coordination, and planning skills with cross-functional teams\nSkilled in applying judgment to balance process compliance with achievement of business objectives\nProject leadership experience focused on engaging others in the delivery and execution of technical solutions and service deliverables\nAbility to assess a project's scope and the team's ability to execute\nOutcome oriented with the ability to drill down from the big picture to process details\nAbility to communicate objectives, plans, status and results clearly\nStrong leadership skills and influencer\nAbility to collaborate across diverse teams and organizations\nStrong organizational skills and detail oriented\nAuthorized to work in the United States without requiring visa sponsorship now or in the future\nPreferred Qualifications\nMasters Degree, preferably in Business/Finance or an analytical field such as Economics, Mathematics, Engineering, Computer Science\nAgile certification or experience\nSolid grasp of software technologies and stacks.\nFormer technical experience is preferred, such as working with data science teams or experience developing and/or deploying predictive models3.6MassMutual\n3.6Boston, MASpringfield, MA5001 - 100001851Company - PrivateInsurance CarriersInsurance$10+ billion (USD)-1003710068.5MassMutualMA1700001000000000000data scientistnaM
735948Data Engineer$62K-$113K (Glassdoor est.)Do you find data architecture exciting? Does building a new data pipeline or optimizing a data warehouse make you happy? Can you migrate a data store to the cloud, run a few NLP algorithms to clean things up, and build a set of processes to keep the data current? Are you comfortable with Terabyte-scale data, optimizing cloud stores, building workflow management systems, AWS, and Python scripting? Can you work closely with business stakeholders to understand their needs and sate those through data solutions? If so, we want you!\n\nFivestars is seeking a Senior Data Engineer. Reporting to the Director of Analytics and Data Science, you will work with the Product, Marketing, and Engineering teams at Fivestars to build and maintain world-class data infrastructure.\n\nAt Fivestars, our mission is to help businesses and communities thrive by turning every transaction into a relationship. Over 50 million people use Fivestars to get rewarded at more than 14,000 local businesses with one rewards program. Local businesses use Fivestars to bring more customers into their stores with an all-in-one marketing and payments program. Fivestars drives over $3 billion in local commerce across its network per year.\n\nFivestars was launched out of Y-Combinator in 2011 (most recently on Y-Combinator's Top 75 Companies List for 2019) and has raised over $105 million from notable investors including Lightspeed, DCM, HarbourVest, Menlo Ventures, Y-Combinator, and others. Together, let's love local!\n\nResponsibilities\nBuild and maintain data infrastructure (Redshift/Presto/Kinesis/Glue/EC2/S3/etc.)\nCreate data pipelines to/from external partners using Python and other tools\nUse NLP to clean and consolidate data\nEstablish and use workflow-management tools to orchestrate solutions\nMonitor and improve pipeline and data-warehouse performance\nSkills\nSQL – write sophisticated and optimized queries against large databases\nPython – create efficient and scalable pipelines and solutions\nBusiness Acumen – understand the questions we are trying to answer through data\nProblem Solving – apply structured methods to analyze problems and develop solutions\nCommunication – explain technical concepts clearly and concisely\nRelationships – influence adoption of infrastructure through partnership\nQualifications/Experience\nUndergraduate degree in a highly technical field (e.g. Computer Science, Electrical Engineering, etc.) from a top-tier university\nGraduate degree (MS, PhD, etc.) in a similar field will be highly valued but is not required\n1+ years of experience in a data-engineering function using cloud-based infrastructure\nAbility to solve technical problems and create efficient, robust, and scalable solutions\nDemonstrated intellectual curiosity\nPerks\nPre-IPO stock options\nExcellent medical, dental, and vision coverage\nGreat downtown-SF office location\n4 weeks PTO + 11 paid-holidays per year\nThree in-office lunches per week and a fully-stocked kitchen with fruit, (healthy) snacks, coffee, and drinks\nTeam happy hours and company-sponsored events\nWellness Benefit - $500 per year to spend on eligible physical or mental well being\nFSA; short-/long-term disability coverage; life Insurance; 401K; EAP; and commuter benefits\nFivestars provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability or genetics. In addition to federal law requirements, Fivestars complies with applicable state and local laws governing nondiscrimination in employment in every location in which the company has facilities. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.3.9Fivestars\n3.9San Francisco, CASan Francisco, CA201 - 5002011Company - PrivateInternetInformation Technology$100 to $500 million (USD)Belly, SpotOn006211387.5FivestarsCA101011100000000000data engineernaP
736949Principal, Data Science - Advanced Analytics$86K-$137K (Glassdoor est.)IQVIA is the leading human data science company focused on helping healthcare clients find unparalleled insights and better solutions for patients. Formed through the merger of IMS Health and Quintiles, IQVIA offers a broad range of solutions that harness the power of healthcare data, domain expertise, transformative technology, and advanced analytics to drive healthcare forward.\n\nJob Description\n\nThe IQVIA Advanced Analytics team is one of the leading healthcare analytical teams in the world. Joining the AA team provides the opportunity to work with extremely complex data and methodologies in a fast-paced, ever-changing environment. We seek highly motivated people who truly want to make a difference in the life sciences industry. At IQVIA, we look for the very best people, and then give them meaningful work to do. we dont simply think about careers, we think about contributions.\n\nAdvanced Analytics - with departments in Philadelphia, Frankfurt, Paris, and Warsaw as well as a network of over 150 team members worldwide - is the global competence center for data science at IQVIA. Complex advanced analysis at the highest level are conceptualized and implemented to support international customers in the pharmaceutical industry - often within multinational projects. As a member of our team you can expect exciting international projects with interesting development perspectives.\n\nThe position will use large data sets to find opportunities for product and process optimization and models to test the effectiveness of different courses of action. Our data scientists have strong experience using a variety of data mining/data analysis methods, building and implementing models, using/creating algorithms and simulations. For this position, we are seeking several years of direct experience with developing algorithms and models to solve prediction problems. Awareness of various techniques available to use in predictive analytics. Using their proven ability to drive business results with their data-based insights, they will comfortably interact and work with a wide range of stakeholders and functional teams. They have a passion for discovering solutions hidden in large data sets and working with stakeholders to improve business outcomes.\n\nWhat were looking for:\nQuantitative background with advanced degrees (Master, PhD preferred) in Statistics, computer science, engineering, informatics, data science, or related field.\nIn-depth understanding of machine learning algorithms and statistical models\nAbility to manage, lead and communicate\nExperience in pharmaceutical or hospital/healthcare industry\nWhat youll be doing:\nBuild machine learning/statistical models and pipelines for solving predictive analytic tasks with electronic healthcare claims and medical records\nApply machine learning, data mining technologies in developing innovative solutions in pharmaceutical industry.\nParticipate at client meetings for complex proposals to present IQVIA advanced analytic methodologies to clients and to bring credibility for IQVIA team\nEnsure data quality throughout all stages of acquisition and processing, including such areas as data collection, normalization, transformation, embedding, visualization, etc.\nPresent study findings to clients and translate analytic outputs to business impact and recommend actions to clients to improve their business performance\nEnsure data quality throughout all stages of acquisition and processing, including such areas as data collection, normalization, transformation, embedding, visualization, etc.\nWork with IQVIA technology team to support machine-learning algorithms in big data platform to solve a variety of business problems.\nIQVIA is an EEO Employer - Minorities/Females/Protected Veterans/Disabled\n\nWe know that meaningful results require not only the right approach but also the right people. Regardless of your role, we invite you to reimagine healthcare with us. You will have the opportunity to play an important part in helping our clients drive healthcare forward and ultimately improve human health outcomes.\n\nWhatever your career goals, we are here to ensure you get there!\n\nWe invite you to join IQVIA.\n\nJoin Us\n\nMaking a positive impact on human health takes insight, curiosity, and intellectual courage. It takes brave minds, pushing the boundaries to transform healthcare. Regardless of your role, you will have the opportunity to play an important part in helping our clients drive healthcare forward and ultimately improve outcomes for patients.\n\nForge a career with greater purpose, make an impact, and never stop learning.\n\nIQVIA is an EEO Employer - Minorities/Females/Protected Veterans/Disabled\n\nIQVIA, Inc. provides reasonable accommodations for applicants with disabilities. Applicants who require reasonable accommodation to submit an application for employment or otherwise participate in the application process should contact IQVIAs Talent Acquisition team at workday_recruiting@iqvia.com to arrange for such an accommodation.3.6IQVIA\n3.6Plymouth Meeting, PADurham, NC10000+2017Company - PublicBiotech & PharmaceuticalsBiotech & Pharmaceuticals$2 to $5 billion (USD)PPD, INC Research, PRA Health Sciences0086137111.5IQVIAPA40000000000000000data scientistsrM
737950Sr Scientist, Immuno-Oncology - Oncology$58K-$111K (Glassdoor est.)Site Name: USA - Massachusetts - Cambridge\nPosted Date: Mar 24 2020\n\nAre you energized by a challenging role in immuno-oncology, where scientific demand is driving team growth? If so, this Senior Scientist would be a great opportunity to consider.\n\nThe Immune Biology Group within GSKs Immuno-Oncology & Combinations Research Unit (IOC RU) is seeking a Sr. Scientist with experience in immuno-oncology or immunology to join our team.\n\nIn this role, you will be responsible for conducting research designed to identify and validate immune-based therapies for cancer.\n\nThis Sr. Scientist role will provide you the opportunity to lead key activities to progress your career. Responsibilities include:\nDeliver critical path biology results to support GSKs pipeline of cancer immunotherapies from early discovery to first-time-in-human commitment.\nEstablish and expand internal wet lab capabilities at a growing GSK site.\nActively participate in building and maintaining drug discovery relationships with both internal stakeholders and external partners.\nWork within a dynamic and collaborative environment to deliver high-quality scientific data packages to meet experimental and organizational goals.\nWhy you?\nBasic Qualifications:\n\n\nWe are looking for professionals with these required skills to achieve our goals:\nBachelors or Masters degree in immunology, immuno-oncology or related field with 5+/3+ years of experience, respectively.\nStrong scientific background in immunology or immuno-oncology research, with a focus on bioassay development to functionally characterize biologics and/or small molecules.\nResearch expertise in the field of adaptive immunity with a focus on T cell biology with demonstrated ability to independently establish robust in vitro and ex vivo functional assay protocols to investigate mechanisms of action for multiple drug candidates and their combinations.\nExpertise in high-dimensional flow cytometry to phenotypically characterize immune cells from human and murine tissue samples, including both surface and intracellular staining.\nDemonstrated hands-on ability to independently design, conduct, and analyze pharmacology studies.\nStrong communication skills and ability to conduct research in a cross-functional team environment.\nAbility to interpret data clearly and concisely both verbally and in documents and present results in an organized manner.\nAbility to prioritize, manage time efficiently, and implement creative solutions to meet program needs.\nCommitment to continual improvement by reading and applying the latest scientific literature, methodologies and technology where appropriate.\nA high level of integrity and desire to develop transformational medicines that bring benefit to patients\nPreferred Qualifications:\n\n\nIf you have the following characteristics, it would be a plus:\n2+ years pharmaceutical or biotechnology industry research experience working in matrixed drug discovery project teams.\nResearch expertise with functional characterization of myeloid cells\nExperience liaising with Laboratory Operations personnel.\nWhy GSK?\n\nOur values and expectations are at the heart of everything we do and form an important part of our culture. These include Patient focus, Transparency, Respect, Integrity along with Courage, Accountability, Development, and Teamwork. As GSK focuses on our values and expectations and a culture of innovation, performance, and trust, the successful candidate will demonstrate the following capabilities:\nOperating at pace and agile decision-making using evidence and applying judgement to balance pace, rigour and risk.\nCommitted to delivering high quality results, overcoming challenges, focusing on what matters, execution.\nContinuously looking for opportunities to learn, build skills and share learning.\nSustaining energy and well-being\nBuilding strong relationships and collaboration, honest and open conversations.\nBudgeting and cost-consciousness\n*LI-GSK\n\n*This is a job description to aide in the job posting, but does not include all job evaluation\n\nIf you require an accommodation or other assistance to apply for a job at GSK, please contact the GSK Service Centre at 1-877-694-7547 (US Toll Free) or +1 801 567 5155 (outside US).\n\nGSK is an Equal Opportunity Employer and, in the US, we adhere to Affirmative Action principles. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, national origin, religion, sex, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, covered/protected veteran status or any other federal, state or local protected class.\n\nImportant notice to Employment businesses/ Agencies\n\nGSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.\n\nPlease note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSKs compliance to all federal and state US Transparency requirements. For more information, please visit GSKs Transparency Reporting For the Record site.3.9GSK\n3.9Cambridge, MABrentford, United Kingdom10000+1830Company - PublicBiotech & PharmaceuticalsBiotech & Pharmaceuticals$10+ billion (USD)Pfizer, AstraZeneca, Merck005811184.5GSKMA1910010000000000000other scientistsrM
738951Senior Data Engineer$72K-$133K (Glassdoor est.)THE CHALLENGE\nEventbrite has a world-class data repository of live events, powering millions of events and hundreds of millions of ticket transactions each year in 170+ countries. Our platform allows event creators and event goers to have the most meaningful live experiences. As a Senior Data Engineer, you will be part of a team that is building our next-gen big data infrastructure to support both internal and customer-facing applications.\nTHE TEAM\nWe're a people-focused Engineering organization: our people value working together in small teams to solve significant problems, supporting an active culture of mentorship and inclusion, and pushing themselves to learn new things daily. Pair programming, weekly demos, tech talks, and quarterly hackathons are at the core of how we’ve built our team and product. We believe in engaging with the community, regularly hosting free events with some of the top technical speakers, and actively contributing to open source software (check out Britecharts as an example!). Our technology spans the web, mobile, API, Big Data, machine learning, search, physical point of sale, scanning systems, and the data infrastructure required to support those systems.\nTHE ROLE\nWe are hiring a Senior Data Engineer to help us build a scalable, reliable, secure, and highly performant data platform. You'll help reinforce and extend the infrastructure that powers the use of data at Eventbrite. From infrastructure development to data analysis to ETL jobs, you will need a broad range of big data engineering skills. The team has strong and versatile engineers. You will grow. We hope to grow with you.\nTHE SKILL SET\n8-10 years of experience building high quality software in Python, Java, or Scala\n5+ years of experience designing batch, streaming, and event-driven Data Warehouse and ETL architectures with Hadoop ecosystem, such as Spark, Hive, Storm, Presto, Kafka, Hbase, MySQL databases, and HDFS\nUnderstanding of Data Engineering, Data Science, Machine Learning, Data Analytics, and the relevant technologies that support them\nDeep expertise in cloud computing, preferably AWS, security, cluster sizing, and performance tuning. Ability to setup process and systems to monitor and reduce cloud computing costs for a large organization\nExperience building systems to instrument, collect and process billions of events, such as clickstream data. Deep understanding of measuring and ensuring data quality at scale\nOutstanding verbal, written, presentation, and facilitation skills. In particular, a demonstrated ability to effectively communicate technical and business issues and solutions to multiple organizational levels\nAbility to teach and mentor engineers with a variety of skill levels and backgrounds\nVision to define the future of how Big Data and Analytics intersect at Eventbrite. The Analytics community at Eventbrite will rely on you to build and maintain a data environment built for speed, accuracy, consistency and uptime\nSkills to support analytics by building a world-class data warehousing environment that empowers analysts to deliver insights to their stakeholders. Evaluate competing data technologies and tool­sets from various vendors and open-source products; drive platform selection; lead technical architecture, application design and implementation\nSkills to support analytics by building a world class data warehousing environment that empowers analysts to deliver insights to their stakeholders\nEvaluate competing data technologies and toolsets from various vendors and open-source products; drive platform selection; lead technical architecture, application design and implementation\nCombine strong analytical skills with the ability to collect, organize and analyze large amounts of information with attention to detail and accuracy\nPassionate about live entertainment, and eager to help build Eventbrite into the world's leading event technology platform\nStrong analytical and problem-solving skills and attention to detail\n\nBONUS POINTS\nFamiliarity with a server-side frameworks, such as Django, Express, Rails, or .Net\nSkilled in various forms of data modeling including ER, XML Schemas, SQL, logical and physical database design, dimensional modeling, and/or OLAP cubes\nKnowledge of database schemas and models, including 3NF, star schemas, cubes, etc. and in developing physical database schemas from logical models\nStrong knowledge of database optimization and scaling approaches including indexing, partitioning, sharding, clustering, in ­memory tables, horizontal and vertical scaling\nFamiliarity with managing large datasets and understanding the complexities of merging large databases, meeting security audit requirements, and implementing a data retention policies\n\nABOUT EVENTBRITE\nEventbrite is a global ticketing and event technology platform, powering millions of live experiences each year. We empower creators of events of all shapes and sizes – from music festivals, experiential yoga, political rallies to gaming competitions –– by providing them the tools and resources they need to seamlessly plan, promote, and produce live experiences around the world. Last year, the team served 795,000 creators hosting nearly 4 million experiences across 170 countries. Meet some of the Britelings that make it happen.\n\nIS THIS ROLE NOT AN EXACT FIT?\nSign up to keep in touch and we’ll let you know when we have new positions on our team.\n\n\nEventbrite is a proud equal opportunity/affirmative action employer supporting workforce diversity. We do not discriminate based upon race, ethnicity, ancestry, citizenship status, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), marital status, registered domestic partner status, caregiver status, sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, genetic information, military or veteran status, mental or physical disability, political affiliation, status as a victim of domestic violence, assault or stalking, or other applicable legally protected characteristics.\nApplicant Privacy Notice4.4Eventbrite\n4.4Nashville, TNSan Francisco, CA1001 - 50002006Company - PublicInternetInformation Technology$100 to $500 million (USD)See Tickets, TicketWeb, Vendini0072133102.5EventbriteTN151110100000100000data engineersrna
739952Project Scientist - Auton Lab, Robotics Institute$56K-$91K (Glassdoor est.)The Auton Lab at Carnegie Mellon University is a large academic group driven by a desire to make a real-world difference in a broad range of research interests. The areas of our current focus include, but are not limited to, modeling complex temporal and sequential data, structural learning, incorporating diverse feedback, interactive network science and human-machine interaction. We are always interested in finding ways to make Artificial Intelligence more accessible, beneficial and affordable to everyone. The areas of our current application interests include healthcare in clinical, managerial, and new sensing modalities contexts, radiation safety, countering human trafficking, agriculture, predictive maintenance of equipment, multi-modal data analytics, etc.\n\nWe are seeking a Project Scientist to join us in the Auton Lab. In this role, you will act as a team leader for specific areas of research projects in applied data science. Working with principal investigator(s), you will prioritize project goals based on overall organizational goals. You will contribute significantly in the development and documentation of research finding and as a major collaborator of scientific papers. There will be frequent opportunities to present research finding to current or potential sponsors and at major national and international conferences.\n\nCore responsibilities will include:\nPreparing data, developing models, and producing research findings\nContributing to project management and maintenance of customer relationships\nDocumenting research findings, producing reports and synthetic summaries\nContributing to scientific publications\nWorking with principal investigator(s) to formulate research goals and plans\nPreparing and delivering presentation of research findings\nQualifications:\nPhD in machine learning, applied mathematics, statistics, computer science, or other relevant field or equivalent combination of training and experience preferred\n10-15 years of Research Experience required\nProven technical background\nExperience in analyzing of data at scale, proven hands-on model development\nFlexibility, excellence, and passion are vital qualities within Auton Lab. Inclusion, collaboration and cultural sensitivity are valued proficiencies at CMU. Therefore, we are in search of a team member who is able to effectively interact with a varied population of internal and external partners at a high level of integrity. We are especially interested in qualified candidates who can contribute through their work/life experiences to the diversity and excellence of the academic community.\n\nYou should demonstrate:\nExcellent communication skills\nAbility to work optimally in a team\nAre you interested in this opportunity with us? Please apply.\n\nMore Information:\n\nPlease visit “Why Carnegie Mellon” to learn more about becoming part of an institution inspiring innovations that change the world.\n\nA listing of employee benefits is available at: www.cmu.edu/jobs/benefits-at-a-glance/.\n\nCarnegie Mellon University is an Equal Opportunity Employer/Disability/Veteran.2.6Software Engineering Institute\n2.6Pittsburgh, PAPittsburgh, PA501 - 10001984College / UniversityColleges & UniversitiesEducationUnknown / Non-Applicable-100569173.5Software Engineering InstitutePA370001000000000000other scientistnaP
740953Data Science Manager$95K-$160K (Glassdoor est.)Data Science ManagerResponsibilities:\n\nOversee a team of Data Scientists and Data Visualization Analysts who transform enterprise data into value drive insights\n\nDesign and implement processes for complex large-scale datasets for data mining, predictive modeling, and research purposes\n\nServe as an advisor for business stakeholders identifying data needs and explaining the importance and use of data applicable to their usage\n\nOversee development of a style guide detailing best practices standards for data visualization\n\nManage the intake process of analytics projects, measure value, and prioritize projects\n\nAlign the department as a customer-oriented service providing insights and information\n\nCoach and mentor team providing specific, timely and constructive feedback\n\nProvide day-to-day leadership and operational management in area of responsibility\n\nExecute objective, plans, and policies in line with enterprise level strategy\n\nProactively find new opportunities to leverage technology for continuous improvement and greater efficiency\n\nContribute to budget development and assist in preparation of operational plans for department\n\nOversee area of responsibility to adhere to approved budgets\n\nMS degree in a quantitative discipline plus a minimum of 5 years of professional work experience\n\nMinimum of 3 years of management experience\n\nProfessional work experience with R and advanced statistical modeling techniques including machine learning techniques\n\nExcellent oral and written communication skills\n\nExcitement, curiosity and passion for shaping the future through digital technology\n\nUS Citizenship or green card required3.2Numeric, LLC\n3.2Allentown, PAChadds Ford, PA1 - 50-1Company - PrivateStaffing & OutsourcingBusiness Services$5 to $10 million (USD)-10095160127.5Numeric, LLCPA-10001000000000000data scientistnana
741955Research Scientist – Security and Privacy$61K-$126K (Glassdoor est.)Returning Candidate? Log back in to the Career Portal and click on 'Job Browsing/History' and find the job you're looking for.\n\n2019-024-OIC: Research Scientist – Security and Privacy\n\nDirectorate Open Innovation Center\nLocation Beavercreek, OH\nIf you want help develop the future technology to ensure security and privacy, Riverside Research’s Trusted and Resilient Systems group is the place for you. We are searching for an individual to join our research group to help shape a more secure future. The team has ongoing research in security of machine learning, cryptography, hardware and hypervisor security solutions, as well as developing cutting edge solutions to the security of open architecture systems. The ideal person for this position is passionate about many diverse areas technology and can leverage their interests to develop and study creative solutions to some of the most difficult challenges. The current team resides in Riverside Research’s Beavercreek, OH, but we are willing to consider candidates that would prefer to work out of one of our Washington DC (Centerville or Crystal City) offices, our New York City office, or our Boston office.\n\nJob Responsibilities:\n•Work with a team of highly skilled researchers to develop interesting and novel solutions to security and privacy problems\n•Publish and present research in conferences and journals\n•Work with the team to identify future areas of research investment and develop research plans\n•Assist with writing technical proposals\n\nQualifications:\n•Ability to obtain and maintain TS/SCI security clearance\n•Bachelor's or Master's degree with significant experience in security privacy research\n•Prior experience developing software\n•Ability to work independently and with a team\n•Superior written and verbal communication skills\nDesired Qualifications:\n\n•Python\n•Web development (we use React)\n•Revision control (we use Git)\n•Machine learning\n•Cryptography\n•Prior experience with government funded research\n\nRiverside Research strives to be one of America’s premier providers of independent, trusted technical and scientific expertise. As we continue to add experienced, technically astute staff, we are looking for highly motivated, talented team members that can help our DoD and Intelligence Community (IC) customers continue delivery of world class programs. As a not-for-profit, technology-oriented Defense Company, we believe service to customers and support of our staff is our mission. Our goal is to serve as a destination company by providing an industry-leading, positive, and rewarding employee experience for all who join us. We aspire to be a valued partner to our customers and to earn their trust through our unwavering commitment to achieve timely, innovative, cost-effective and mission-focused solutions.\n\nAll positions at Riverside Research are subject to background investigations. Employment is contingent upon successful completion of a background investigation including criminal history and identity check.\n\nThis contractor and subcontractor shall abide by the requirements of 41 CFR 60-741.5(a). This regulation prohibits discrimination against qualified individuals on the basis of disability, and requires affirmative action by covered prime contractors and subcontractors to employ and advance in employment qualified individuals with disabilities.\n\nThis contractor and subcontractor shall abide by the requirements of 41 CFR 60-300.5(a). This regulation prohibits discrimination against qualified protected veterans, and requires affirmative action by covered contractors and subcontractors to employ and advance in employment qualified protected veterans.\n\nApply Now3.6Riverside Research Institute\n3.6Beavercreek, OHArlington, VA501 - 10001967Nonprofit OrganizationFederal AgenciesGovernment$50 to $100 million (USD)-1006112693.5Riverside Research InstituteOH541000000000000000other scientistnaM